jsoma / fuzzy_pandasLinks
Fuzzy matches and merging of datasets in pandas using csvmatch
☆74Updated 5 years ago
Alternatives and similar repositories for fuzzy_pandas
Users that are interested in fuzzy_pandas are comparing it to the libraries listed below
Sorting:
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆114Updated 6 months ago
- Fast, flexible name matching for large datasets☆72Updated 2 weeks ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆111Updated 6 months ago
- Dataset of state legislative elections from 1971–2018.☆45Updated 6 years ago
- A simple Python wrapper for U.S. Census Geocoding Services API batch service☆42Updated 6 months ago
- Python wrapper for the US Census Geocoder☆76Updated last month
- This repository includes data for snap analyses of the 2018 Midterm Elections using unofficial election returns data.☆49Updated 6 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆20Updated 3 months ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- Text and statistics utilities from Pew Research Center☆84Updated 3 years ago
- Walk through making basic charts — and a choropleth map — with this Altair tutorial.☆22Updated 2 years ago
- Incarceration Trends Dataset and Documentation☆93Updated this week
- 150,000 tweets from 2016's second presdential debate between Hillary Clinton and Donald Trump☆10Updated 8 years ago
- Python 3 version of Quantipy☆39Updated 2 years ago
- Automated coding using machine-learning and remapping the U.S. nonprofit sector: A guide and benchmark☆24Updated last year
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆19Updated last month
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 11 months ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆37Updated 3 weeks ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆72Updated 3 weeks ago
- Materials for a NICAR 2020 workshop on advanced Census data with Python☆17Updated 2 years ago
- This repository includes all the data analyses I carry out for my general exams reading, Spring 2015☆64Updated 10 years ago
- NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFs☆12Updated 6 years ago
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆15Updated 2 years ago
- Add state and county fips codes to data☆42Updated 11 months ago
- Guess gender from first name in Python 2 and 3☆134Updated 2 weeks ago
- Get Census Data from the API for arbitrary areas☆46Updated last month
- Geocoding APIs repo for NICAR20 session☆14Updated 5 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆70Updated 3 years ago
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆47Updated 8 years ago