edx / pa11ycrawlerLinks
Python crawler (using Scrapy) that uses Pa11y to check accessibility of pages as it crawls.
☆18Updated 6 years ago
Alternatives and similar repositories for pa11ycrawler
Users that are interested in pa11ycrawler are comparing it to the libraries listed below
Sorting:
- A price comparison engine built with Django , Scrapy☆11Updated 9 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 3 weeks ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- ScraperWiki Python library for scraping and saving data; in maintenance mode☆158Updated this week
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 12 years ago
- Extensions for using Scrapy on Amazon AWS☆32Updated 13 years ago
- Scrape email-addresses from a user-provided domain☆20Updated 7 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated this week
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆39Updated 5 years ago
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆126Updated 12 years ago
- Template for creating a scraper that saves to Google Sheets, fires Slack notifications, and is scheduled using AWS Lambda and CloudWatch☆10Updated 7 years ago
- Python module to watch Twitter user pages or search-results.☆64Updated 11 years ago
- Python bot that crawls your website looking for dead stuff☆43Updated 3 years ago
- Simple Web UI for Scrapy spider management via Scrapyd☆50Updated 7 years ago
- Akvo Really Simple Reporting☆40Updated 3 months ago
- A modular template for scraping data from the web to send yourself scheduled email reports☆41Updated 5 years ago
- Scrapy middleware which allows to crawl only new content☆79Updated this week
- Speed up your Localization / Internationalization efforts by automating translation with a single script☆27Updated 8 years ago
- Simple to use python library for Buffer App☆23Updated 3 years ago
- An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site☆128Updated 6 years ago
- Suite of tools for detecting changes in web pages and their rendering☆55Updated 2 years ago
- A scrapy extension to store requests and responses information in storage service☆27Updated 3 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆15Updated 8 years ago
- Code and data belonging to our CSCW 2019 paper: "Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites".☆136Updated 6 years ago
- Big Five personality traits: domains, aspects, facets☆25Updated 9 months ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 10 years ago
- Tools for tracking stories on news homepages☆48Updated 6 years ago