jsoma / fuzzy_pandasLinks
Fuzzy matches and merging of datasets in pandas using csvmatch
☆76Updated 5 years ago
Alternatives and similar repositories for fuzzy_pandas
Users that are interested in fuzzy_pandas are comparing it to the libraries listed below
Sorting:
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆112Updated last year
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Updated 3 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated 3 weeks ago
- Python wrapper for the US Census Geocoder☆78Updated 7 months ago
- Quickly adjust U.S. dollars for inflation using the Consumer Price Index (CPI)☆137Updated last week
- A light-weight wrapper for the Datawrapper API.☆83Updated this week
- Tableau scraper python library. R and Python scripts to scrape data from Tableau viz☆135Updated last year
- Download U.S. census data and reformat it for humans☆225Updated last year
- Fast, flexible name matching for large datasets☆72Updated 2 months ago
- Get Census Data from the API for arbitrary areas☆46Updated 7 months ago
- Text and statistics utilities from Pew Research Center☆86Updated 3 years ago
- Dataset of state legislative elections from 1971–2018.☆46Updated 6 years ago
- A simple Python wrapper for U.S. Census Geocoding Services API batch service☆42Updated 11 months ago
- 🔎 Finds fuzzy matches between CSV files☆190Updated 7 months ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆24Updated 8 months ago
- Materials for a NICAR 2020 workshop on advanced Census data with Python☆17Updated 2 years ago
- Walk through making basic charts — and a choropleth map — with this Altair tutorial.☆22Updated 3 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆71Updated 4 years ago
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆47Updated 9 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆38Updated 4 months ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆77Updated 2 weeks ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 6 years ago
- Download IPEDS complete data files☆38Updated 7 years ago
- Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites☆33Updated last week
- Automated coding using machine-learning and remapping the U.S. nonprofit sector: A guide and benchmark☆24Updated 3 months ago
- List of data journalism courses and programmes from universities and higher education institutions around the world☆73Updated 2 years ago
- Standardized data on historical general election polling places in the United States.☆73Updated 3 years ago
- Workbook to teach the concept of risk ratios for data journalism applications☆33Updated 3 years ago