uscensusbureau / SABLELinks
Scraping Assisted by Learning
☆35Updated last month
Alternatives and similar repositories for SABLE
Users that are interested in SABLE are comparing it to the libraries listed below
Sorting:
- Using NLP to find and extract specific information from long, unstructured documents☆15Updated 7 years ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆188Updated 4 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 6 years ago
- The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)☆65Updated last year
- A search engine for Open Data☆53Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆28Updated 6 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 8 months ago
- A selection of business datasets☆18Updated 5 years ago
- Examples for getting started using https://case.law☆66Updated 2 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- A maximum-strength name parser for record linkage.☆37Updated 3 weeks ago
- GraphiPy: Universal Social Data Extractor☆84Updated 2 years ago
- Techniques for Scraping the Web in Python☆25Updated 7 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 4 months ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 7 months ago
- Scrapes sites. Gets news. Eventually events.☆87Updated 9 years ago
- This repository explores various Numpy commands which are quite useful for working with datasets and handling array operations.☆13Updated 6 years ago
- An automated, programming-free web scraper for interactive sites☆111Updated 2 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Now included in rigour☆151Updated 2 months ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- legacy backend for Open States☆87Updated 5 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Train a neural network optimized for generating Reddit subreddit posts☆28Updated 7 years ago
- ☆72Updated 6 months ago
- Python wrapper for a C++ Double Metaphone☆15Updated this week