jsoma / fuzzy_pandas
Fuzzy matches and merging of datasets in pandas using csvmatch
☆74Updated 4 years ago
Alternatives and similar repositories for fuzzy_pandas:
Users that are interested in fuzzy_pandas are comparing it to the libraries listed below
- Python wrapper for the US Census Geocoder☆74Updated 8 months ago
- Fast, flexible name matching for large datasets☆70Updated last year
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆106Updated 2 months ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆109Updated last month
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆282Updated 2 years ago
- A simple Python wrapper for U.S. Census Geocoding Services API batch service☆42Updated 2 months ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆35Updated last year
- Dataset of state legislative elections from 1971–2018.☆45Updated 5 years ago
- Text and statistics utilities from Pew Research Center☆82Updated 2 years ago
- Get Census Data from the API for arbitrary areas☆44Updated 4 months ago
- This repository includes data for snap analyses of the 2018 Midterm Elections using unofficial election returns data.☆49Updated 6 years ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆72Updated last month
- Materials for a NICAR 2020 workshop on advanced Census data with Python☆17Updated last year
- Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites☆29Updated this week
- ☆21Updated last year
- Download IPEDS complete data files☆39Updated 6 years ago
- Loads raw FEC filings into a database☆22Updated 2 years ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆17Updated last month
- Open Source Proxy Demographic module written in Python☆32Updated 8 months ago
- General programming utilities from Pew Research Center☆69Updated 2 years ago
- Workbook to teach the concept of risk ratios for data journalism applications☆32Updated 2 years ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated last year
- Ecological inference, in Python☆29Updated last week
- a general list of resources and articles for people interested in getting into data journalism☆16Updated last year
- Incarceration Trends Dataset and Documentation☆91Updated 2 months ago
- Standardized data on historical general election polling places in the United States.☆72Updated 3 years ago
- Generates a long-form version of every field in the IRS 990 e-file dataset based on the NOPDC "Datathon" concordance☆33Updated 6 years ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on scraping web data using Python.☆19Updated last year
- Download data from Census API☆141Updated 2 years ago
- IRSx: Turn the IRS' versioned XML 990 nonprofit annual tax returns into standardized python objects, json, or human readable text with or…☆119Updated 7 months ago