dedupeio / doublemetaphoneLinks
Python wrapper for a C++ Double Metaphone
☆15Updated last month
Alternatives and similar repositories for doublemetaphone
Users that are interested in doublemetaphone are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆39Updated 4 months ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- A Jupyter Lab extension for rendering tabular data☆35Updated 7 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆15Updated 6 years ago
- Python for people data☆71Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated last month
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated 2 years ago
- Creating user interfaces for data science with Jupyter widgets☆11Updated 8 years ago
- A simple library for adding noise to data.☆12Updated 6 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆35Updated 3 months ago
- CSV inspection☆10Updated 3 years ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆113Updated last year
- Collection of code snippets and utilities for streamlit apps☆22Updated 5 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 7 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated 7 months ago
- A selection of statistical graphics for vega in python, based on altair.☆103Updated 2 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- A xlsx and html rendering library for rendering data available in Pandas DataFrames.☆25Updated last year
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 5 years ago
- A friendly pandas wrapper with a more composable grammar support.☆14Updated 8 years ago
- Fast, flexible name matching for large datasets☆71Updated 4 months ago
- ☆32Updated 8 years ago
- Inspired by John Foreman. Created by the crowds.☆54Updated 2 years ago