uscensusbureau / SABLELinks
Scraping Assisted by Learning
☆35Updated 2 months ago
Alternatives and similar repositories for SABLE
Users that are interested in SABLE are comparing it to the libraries listed below
Sorting:
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 5 months ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 9 months ago
- An automated, programming-free web scraper for interactive sites☆111Updated 2 years ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆15Updated last year
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆189Updated 4 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- A search engine for Open Data☆55Updated 2 years ago
- Examples for getting started using https://case.law☆66Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- The CorpWatch API uses automated parsers to extract the subsidiary relationship information from Exhibit 21 of companies' 10-K filings wi…☆48Updated 6 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 5 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- Scrapes sites. Gets news. Eventually events.☆88Updated 9 years ago
- Our officially supported Python client library for accessing News API.☆37Updated 6 years ago
- Now included in rigour☆151Updated this week
- A maximum-strength name parser for record linkage.☆37Updated last month
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 6 years ago
- Python 3.x notebooks about real-world data cleaning and visualization☆72Updated 9 years ago
- Python client for the Center for Responsive Politics API at OpenSecrets.org.☆43Updated 5 years ago
- Train a neural network optimized for generating Reddit subreddit posts☆28Updated 7 years ago
- Scrapes Google Trends data over long timescales and stitches together for daily data☆72Updated 5 years ago
- Ontology dataset for open_numbers namespace☆10Updated 8 months ago
- Techniques for Scraping the Web in Python☆25Updated 7 years ago
- Tutorials for getting the most out of Twitter data.☆105Updated 2 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Python package for data.world☆101Updated last year
- framework for scraping legislative/government data☆86Updated 10 months ago
- 📚 Doing all sorts of things, the DataMade way☆99Updated 4 months ago
- Various Jupyter notebooks about Common Crawl data☆55Updated 4 months ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆118Updated 5 years ago