dedupeio / doublemetaphone
Python wrapper for a C++ Double Metaphone
☆15Updated 2 years ago
Alternatives and similar repositories for doublemetaphone:
Users that are interested in doublemetaphone are comparing it to the libraries listed below
- A maximum-strength name parser for record linkage.☆36Updated last week
- ☆13Updated 5 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated 2 weeks ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Just charts. Really.☆22Updated last year
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- CSV inspection☆10Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆46Updated last year
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated last year
- A friendly pandas wrapper with a more composable grammar support.☆14Updated 8 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 3 weeks ago
- Graph extraction and NLP analysis for Baleen Corpora☆18Updated 8 years ago
- Get Artist Concerts History from setlist.fm website☆11Updated 2 years ago
- Exploring sequential data with a sankey diagram☆22Updated last year
- Dexter document monitor for MMA☆16Updated 11 months ago
- Burglary prediction for mortals☆10Updated 10 months ago
- Construct, deconstruct, convert, execute, and prepare slides from Jupyter notebooks☆34Updated this week
- Slideshow template for Voilà based on RevealJS☆16Updated 3 years ago
- Tools for analyzing the Hillary Clinton emails☆13Updated 8 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 7 months ago
- The Path of the PyData Ninja☆16Updated 9 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago