edx / pa11ycrawler
Python crawler (using Scrapy) that uses Pa11y to check accessibility of pages as it crawls.
☆18Updated 5 years ago
Alternatives and similar repositories for pa11ycrawler:
Users that are interested in pa11ycrawler are comparing it to the libraries listed below
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 12 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Scrapy pipeline which allows you to store scrapy items in a solr server.☆19Updated 8 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Twitter crawler☆11Updated 10 years ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 3 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 8 years ago
- Extract data from an HTML table and store results to a csv file.☆38Updated 9 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 11 months ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Documentation website for Storex☆17Updated 2 years ago
- legacy backend for Open States☆87Updated 5 years ago
- An OpenCalais API Interface for Python.☆20Updated 13 years ago
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆28Updated last year
- a Simple API for RDF☆29Updated 15 years ago
- extract difference between two html pages☆32Updated 6 years ago
- a set of services that provide NLP facilities