sunlightlabs / name-cleaverLinks
Parser and standardizer for politician, individual and organization names.
☆128Updated 8 years ago
Alternatives and similar repositories for name-cleaver
Users that are interested in name-cleaver are comparing it to the libraries listed below
Sorting:
- A repository of journalist's lookup tables.☆107Updated 8 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 5 years ago
- Front-end for the MediaCloud database☆16Updated 7 years ago
- A deprecated Python wrapper for the DocumentCloud API☆62Updated 5 years ago
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 15 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆65Updated 13 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 10 years ago
- legacy backend for Open States☆87Updated 6 years ago
- Tools for text tokenization and encoding☆84Updated 4 years ago
- Scrapers for US municipal governments.☆104Updated 2 months ago
- Source for census.ire.org, including data processing scripts.☆140Updated 3 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 11 years ago
- Collecting reports from Inspectors General across the US federal government.☆112Updated 5 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆79Updated 9 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date☆122Updated 3 years ago
- Ask questions about government data.☆38Updated 7 years ago
- framework for scraping legislative/government data☆89Updated 2 months ago
- ☆23Updated 10 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- NICAR 2016 talk about PDFs!☆63Updated 9 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- PANDA: A Newsroom Data Appliance☆208Updated 3 years ago
- 🔎 Finds fuzzy matches between CSV files☆191Updated 10 months ago
- Command line tool for deduplicating CSV files☆432Updated 5 years ago
- a python parser for the .fec file format☆46Updated 9 months ago
- Python scripts to parse U.S. voter files☆122Updated 4 years ago
- Code for Newslynx App☆22Updated 10 years ago
- Tracking changes to the official U.S. House and Senate roll call votes XML data files. Monitored hourly-ish by @GovTrack/@JoshData.☆33Updated 7 years ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 9 years ago