public-law / open-gov-crawlers
Parse government documents into well formed JSON
☆64Updated 5 months ago
Related projects: ⓘ
- Web scraping Page Objects core library☆93Updated 2 months ago
- Save an RSS or ATOM feed to a SQLite database☆46Updated last year
- Web grep: search all rendered resources used by a URI☆83Updated 2 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆51Updated 3 weeks ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Zyte Automatic Extraction integration for Scrapy☆55Updated 2 years ago
- Scrape various open data directories to create an index of what's available out there☆29Updated this week
- Add website scraping abilities to Datasette☆59Updated last year
- A financial disclosure data extraction tool.☆13Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆31Updated last year
- A helper library full of URL-related heuristics.☆56Updated 2 weeks ago
- Python client for Zyte API☆19Updated 3 months ago
- Python clients for Zyte AutoExtract API☆39Updated 2 years ago
- A repository demonstrating the use of real-estate-scrape to store the estimated value of a property on Redfin and Zillow every night usin…☆28Updated this week
- Scrapy rotation proxy package with advanced functions☆92Updated 2 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated last week
- Common interface for data container classes☆61Updated last month
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the…☆36Updated last month
- Scrapfly Python SDK for headless browsers and proxy rotation☆30Updated last week
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated last year
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆66Updated this week
- 🕶 Awesome list of Scrapy tools and libraries☆54Updated 4 years ago
- ☆29Updated 3 years ago
- Extract networks of entities from journalistic reporting☆46Updated last year
- Page Object pattern for Scrapy☆119Updated 2 months ago
- Datasette plugin providing data dashboards from metadata☆137Updated last week
- A Python client for the People Data Labs API☆24Updated this week
- Software stack with latest Scrapy and updated deps☆60Updated this week
- JavaScript support and proxy rotation for Scrapy with ScrapingBee.☆34Updated 4 months ago
- A modern Python library for writing maintainable web scrapers.☆244Updated 2 months ago