sunlightlabs / name-cleaver
Parser and standardizer for politician, individual and organization names.
☆128Updated 7 years ago
Related projects: ⓘ
- A Python library for downloading, parsing and cleaning Federal Election Commission filings.☆27Updated 7 months ago
- A repository of journalist's lookup tables.☆103Updated 7 years ago
- search document dumps: ingest and explore in one extensible framework☆124Updated 4 years ago
- ☆13Updated this week
- ☆8Updated 8 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆63Updated 11 years ago
- ☆36Updated 7 years ago
- ☆37Updated this week
- Twitter, quick. Fetch and store tweets on short notice.☆80Updated 7 years ago
- Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date☆121Updated 2 years ago
- A simple Python library/tool for pulling location information from unstructured text☆184Updated 13 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Tools for tracking stories on news homepages☆48Updated 4 years ago
- Unified Python bindings for Sunlight APIs☆66Updated 8 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆123Updated 3 years ago
- Code for Newslynx App☆22Updated 8 years ago
- A deprecated Python wrapper for the DocumentCloud API☆64Updated 3 years ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 7 years ago
- A working parser for the US Code's hierarchy, and a work-in-progress parser for the full content.☆104Updated 10 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 3 years ago
- legacy backend for Open States☆87Updated 4 years ago
- Tools for text tokenization and encoding☆84Updated 2 years ago
- Collecting reports from Inspectors General across the US federal government.☆107Updated 3 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆54Updated 9 years ago
- Source for census.ire.org, including data processing scripts.☆139Updated 2 years ago
- A web service for disambiguating and canonically storing entities.☆25Updated 5 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 8 years ago
- Scripts to scrape the FEC website and parse campaign filings☆45Updated 12 years ago
- Parse the NYPD's weekly per-precinct crime complaints stats to CSV or MySQL☆24Updated 7 years ago
- a set of services that provide NLP facilities☆25Updated 3 years ago