dedupeio / doublemetaphoneLinks
Python wrapper for a C++ Double Metaphone
☆15Updated 2 weeks ago
Alternatives and similar repositories for doublemetaphone
Users that are interested in doublemetaphone are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆39Updated 2 months ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆15Updated 6 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- Creating user interfaces for data science with Jupyter widgets☆11Updated 8 years ago
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated 2 years ago
- Python for people data☆69Updated last year
- Scalable String Similarity Joins in Python☆39Updated last year
- The Path of the PyData Ninja☆16Updated 10 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- A selection of statistical graphics for vega in python, based on altair.☆102Updated 2 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 7 years ago
- CSV inspection☆10Updated 2 years ago
- Collection of code snippets and utilities for streamlit apps☆22Updated 5 years ago
- Render reproducible examples of Python code for posting to GitHub or Stack Overflow (port of R package reprex)☆90Updated 2 weeks ago
- Predict age and gender from a first name☆59Updated 7 years ago
- A simple library for adding noise to data.☆12Updated 6 years ago
- A friendly pandas wrapper with a more composable grammar support.☆14Updated 8 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Data exploration library with a pandas-like API☆74Updated 5 years ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆112Updated last year
- ☆31Updated 9 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated 6 months ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- Construct, deconstruct, convert, execute, and prepare slides from Jupyter notebooks☆35Updated 6 months ago
- Ensemble topic modeling with matrix factorization☆24Updated 7 years ago
- Open Source Proxy Demographic module written in Python☆35Updated last year