sunlightlabs / name-cleaverLinks
Parser and standardizer for politician, individual and organization names.
☆129Updated 8 years ago
Alternatives and similar repositories for name-cleaver
Users that are interested in name-cleaver are comparing it to the libraries listed below
Sorting:
- A repository of journalist's lookup tables.☆106Updated 8 years ago
- Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date☆123Updated 3 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆65Updated 12 years ago
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 14 years ago
- legacy backend for Open States☆87Updated 5 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 10 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Collecting reports from Inspectors General across the US federal government.☆109Updated 4 years ago
- Scrapers for US municipal governments.☆102Updated last year
- a python parser for the .fec file format☆46Updated 2 months ago
- Data Pipes for CSV☆116Updated 2 years ago
- Front-end for the MediaCloud database☆16Updated 7 years ago
- Code for Newslynx App☆22Updated 9 years ago
- ☆8Updated 9 years ago
- 🔎 Finds fuzzy matches between CSV files☆190Updated 3 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- NICAR 2016 talk about PDFs!☆62Updated 9 years ago
- A step-by-step guide to publishing a simple news application.☆75Updated 7 years ago
- Unified Python bindings for Sunlight APIs☆66Updated 9 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Scripts to scrape the FEC website and parse campaign filings☆45Updated 13 years ago
- PANDA: A Newsroom Data Appliance☆205Updated 3 years ago
- framework for scraping legislative/government data☆86Updated 10 months ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆80Updated last year
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆124Updated 3 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆80Updated 8 years ago
- A deprecated Python wrapper for the DocumentCloud API☆62Updated 4 years ago
- ☆23Updated 10 years ago