dedupeio / doublemetaphone
Python wrapper for a C++ Double Metaphone
☆15Updated 2 years ago
Alternatives and similar repositories for doublemetaphone:
Users that are interested in doublemetaphone are comparing it to the libraries listed below
- ☆13Updated 6 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A maximum-strength name parser for record linkage.☆37Updated this week
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated last month
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- Statistical visualizations for Datasette using Seaborn☆12Updated 3 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- The Path of the PyData Ninja☆16Updated 9 years ago
- Slideshow template for Voilà based on RevealJS☆16Updated 3 years ago
- motivational website to do something special this month☆21Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A python module that will check for package updates.☆28Updated 3 years ago
- A friendly pandas wrapper with a more composable grammar support.☆14Updated 8 years ago
- Creating user interfaces for data science with Jupyter widgets☆11Updated 7 years ago
- Get Artist Concerts History from setlist.fm website☆11Updated 2 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆18Updated 2 years ago
- CSV inspection☆10Updated 2 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Data Scientist code test☆19Updated 4 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- IPython Magic for exporting pandas objects to Excel☆13Updated 7 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated last month
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- A command line utility to create kernels in Jupyter from virtual environments.☆16Updated 7 years ago
- A simple library for adding noise to data.☆12Updated 6 years ago