sunlightlabs / name-cleaverLinks
Parser and standardizer for politician, individual and organization names.
☆129Updated 8 years ago
Alternatives and similar repositories for name-cleaver
Users that are interested in name-cleaver are comparing it to the libraries listed below
Sorting:
- A repository of journalist's lookup tables.☆107Updated 8 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆65Updated 12 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 14 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 10 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- A step-by-step guide to publishing a simple news application.☆75Updated 7 years ago
- A deprecated Python wrapper for the DocumentCloud API☆62Updated 4 years ago
- Collecting reports from Inspectors General across the US federal government.☆110Updated 4 years ago
- Ask questions about government data.☆38Updated 6 years ago
- Alpha-quality parser for Office of Government ethics form 278 public financial disclosure PDFs☆26Updated 3 years ago
- a python parser for the .fec file format☆46Updated 4 months ago
- ☆23Updated 10 years ago
- Scrapers for US municipal governments.☆105Updated last year
- Source for census.ire.org, including data processing scripts.☆140Updated 3 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- NICAR 2016 talk about PDFs!☆63Updated 9 years ago
- Code for Newslynx App☆22Updated 9 years ago
- Front-end for the MediaCloud database☆16Updated 7 years ago
- A Los Angeles Times analysis of serious assaults misclassified by LAPD☆62Updated 6 years ago
- ☆36Updated 8 years ago
- Tracking changes to the official U.S. House and Senate roll call votes XML data files. Monitored hourly-ish by @GovTrack/@JoshData.☆33Updated 6 years ago
- 🔎 Finds fuzzy matches between CSV files☆191Updated 5 months ago
- CFPB's streaming batch geocoder☆36Updated 9 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- ☆20Updated 8 years ago
- Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date☆123Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago