uscensusbureau / SABLE
Scraping Assisted by Learning
☆35Updated this week
Alternatives and similar repositories for SABLE:
Users that are interested in SABLE are comparing it to the libraries listed below
- A selection of business datasets☆18Updated 5 years ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆17Updated 8 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A financial disclosure data extraction tool.☆14Updated last year
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 3 weeks ago
- Since 2002, the Mexican Federal government handles most of its procurement biddings through a transactional platform called Compranet. E…☆11Updated 7 years ago
- ☆12Updated 5 years ago
- Train a neural network optimized for generating Reddit subreddit posts☆28Updated 6 years ago
- Ontology dataset for open_numbers namespace☆10Updated 4 months ago
- ☆12Updated last week
- A maximum-strength name parser for record linkage.☆36Updated last month
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- A processing pipeline for JSON-formatted Tweet data, such as that returned by Twitter APIs.☆12Updated 7 years ago
- 📒 Analyzing Data, the DataMade Way☆37Updated 4 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 5 months ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 5 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- A curated list of awesome data sources related to elections, electoral reforms, and democratic political systems.☆74Updated 3 years ago
- A search engine for Open Data☆53Updated 2 years ago
- An API wrapper for Throne.AI☆12Updated 7 years ago
- Automated data extraction from U.S. state Comprehensive Annual Financial Reports (CAFR).☆16Updated 3 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆27Updated last year
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 10 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- https://www.washingtonpost.com/graphics/2020/investigations/helicopter-protests-washington-dc-national-guard/☆23Updated 4 years ago
- Open Data 500☆22Updated 7 years ago
- A browser user interface for manual labeling of record pairs.☆45Updated last year