dedupeio / doublemetaphoneLinks
Python wrapper for a C++ Double Metaphone
☆15Updated this week
Alternatives and similar repositories for doublemetaphone
Users that are interested in doublemetaphone are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆38Updated 3 weeks ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Collection of code snippets and utilities for streamlit apps☆22Updated 5 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- IPython Magic for exporting pandas objects to Excel☆13Updated 7 years ago
- ☆30Updated 3 years ago
- Twitter Discovery: Search articles referenced in your tweets, retweets, and favorites☆16Updated 5 years ago
- Creating user interfaces for data science with Jupyter widgets☆11Updated 7 years ago
- The Path of the PyData Ninja☆16Updated 10 years ago
- Inspired by John Foreman. Created by the crowds.☆54Updated last year
- Python for people data☆69Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated 2 years ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆112Updated 10 months ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Predict age and gender from a first name☆59Updated 7 years ago
- A selection of statistical graphics for vega in python, based on altair.☆102Updated last year
- Data exploration library with a pandas-like API☆74Updated 5 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- CSV inspection☆10Updated 2 years ago
- Fast, flexible name matching for large datasets☆72Updated last month
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆77Updated 3 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago
- ☆27Updated 6 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago