public-law / open-gov-crawlers
Parse government documents into well formed JSON
☆67Updated last week
Alternatives and similar repositories for open-gov-crawlers:
Users that are interested in open-gov-crawlers are comparing it to the libraries listed below
- Reading legal authority for the last time☆34Updated this week
- World legal info: scraped, organized, and permissively licensed under Creative Commons.☆16Updated last year
- Scrape various open data directories to create an index of what's available out there☆36Updated last week
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated last year
- A helper library full of URL-related heuristics.☆64Updated 4 months ago
- Save an RSS or ATOM feed to a SQLite database☆47Updated 2 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- JavaScript support and proxy rotation for Scrapy with ScrapingBee.☆38Updated 9 months ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- ☆29Updated 3 years ago
- Web scraping Page Objects core library☆96Updated last week
- Add website scraping abilities to Datasette☆62Updated last year
- Library for scraping websites or apis at any scale☆53Updated last year
- Common interface for data container classes☆66Updated last week
- Extract text from HTML☆133Updated 4 years ago
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the…☆36Updated 6 months ago
- A financial disclosure data extraction tool.☆13Updated last year
- Software stack with latest Scrapy and updated deps☆63Updated last week
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- A repository demonstrating the use of real-estate-scrape to store the estimated value of a property on Redfin and Zillow every night usin…☆30Updated this week
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- Paginating the web☆37Updated 11 years ago
- A microservice for document conversion at scale☆62Updated 2 weeks ago
- Scrape HN to track links from specific domains☆52Updated this week
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆55Updated 2 months ago
- Open States data model and scraper backend☆25Updated this week
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 4 months ago
- A middleware layer for Scrapy that detects CAPTCHA tests and solves them☆45Updated last year
- Parser for U.S. federal regulations and other regulatory information☆39Updated last year
- Zyte API integration for Scrapy☆37Updated this week