edx / pa11ycrawlerLinks
Python crawler (using Scrapy) that uses Pa11y to check accessibility of pages as it crawls.
☆18Updated 6 years ago
Alternatives and similar repositories for pa11ycrawler
Users that are interested in pa11ycrawler are comparing it to the libraries listed below
Sorting:
- Template for creating a scraper that saves to Google Sheets, fires Slack notifications, and is scheduled using AWS Lambda and CloudWatch☆10Updated 7 years ago
- Python bot that crawls your website looking for dead stuff☆43Updated 3 years ago
- Speed up your Localization / Internationalization efforts by automating translation with a single script☆27Updated 8 years ago
- A web app that uses logarithmic regression to predict the outcome of tennis matches. Built with Python's Scikit-learn package and Flask☆11Updated 11 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆39Updated 5 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 12 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆118Updated 2 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated last week
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated 2 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆93Updated 3 months ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆192Updated 4 years ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 6 years ago
- Scrapy project with spiders to extract article content from various german news sites☆21Updated 12 years ago
- Web scraping and automation using python☆61Updated 5 years ago
- A Python client for Chrome's DevTools protocol / a headless chrome control library☆15Updated 7 years ago
- ☆65Updated 6 years ago
- A component that tries to avoid downloading duplicate content☆27Updated last week
- A modular template for scraping data from the web to send yourself scheduled email reports☆41Updated 5 years ago
- Big Five personality traits: domains, aspects, facets☆25Updated 9 months ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆10Updated 7 months ago
- Scrape tables from Wikipedia articles into CSVs☆75Updated 5 years ago
- A trend viewer written in Python/JavaScript☆21Updated last year
- Find which links on a web page are pagination links☆29Updated 9 years ago
- ScraperWiki Python library for scraping and saving data; in maintenance mode☆158Updated last week
- Demo of the Newspaper article extraction library.☆29Updated 11 years ago
- ☆59Updated 4 years ago
- A curated list of ways to take Awesome Website Screenshots☆91Updated 3 years ago