sunlightlabs / name-cleaver
Parser and standardizer for politician, individual and organization names.
☆129Updated 7 years ago
Alternatives and similar repositories for name-cleaver:
Users that are interested in name-cleaver are comparing it to the libraries listed below
- A repository of journalist's lookup tables.☆106Updated 8 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆65Updated 12 years ago
- legacy backend for Open States☆87Updated 5 years ago
- a python parser for the .fec file format☆45Updated last week
- ☆8Updated 8 years ago
- Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date☆123Updated 3 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- A deprecated Python wrapper for the DocumentCloud API☆62Updated 4 years ago
- ☆36Updated 7 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 10 years ago
- Collecting reports from Inspectors General across the US federal government.☆109Updated 4 years ago
- Tracking changes to the official U.S. House and Senate roll call votes XML data files. Monitored hourly-ish by @GovTrack/@JoshData.☆33Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Code for Newslynx App☆22Updated 9 years ago
- A Ruby gem that extracts press releases and statements by members of Congress.☆70Updated 9 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- Data for examining Uniform Crime Reporting data for 68 major cities.☆41Updated 8 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆82Updated 3 years ago
- pneumatic is a bulk-upload library for DocumentCloud.☆22Updated 4 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆151Updated 3 months ago
- yet another foia automation service☆43Updated 2 years ago
- A collection of introductions to various datasets, giving journalists some friendly background before they start doing analysis. Like "Hi…☆71Updated 10 years ago
- Scripts to scrape the FEC website and parse campaign filings☆45Updated 13 years ago
- Python client library for controlling Google Refine☆83Updated 7 years ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆27Updated last year
- Unified Python bindings for Sunlight APIs☆66Updated 9 years ago
- ☆23Updated 9 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago