dedupeio / doublemetaphone
Python wrapper for a C++ Double Metaphone
☆15Updated 2 years ago
Alternatives and similar repositories for doublemetaphone:
Users that are interested in doublemetaphone are comparing it to the libraries listed below
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- ☆13Updated 5 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Creating user interfaces for data science with Jupyter widgets☆11Updated 7 years ago
- Slideshow template for Voilà based on RevealJS☆16Updated 3 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- A browser user interface for manual labeling of record pairs.☆45Updated last year
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 6 months ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆17Updated 2 years ago
- Scalable String Similarity Joins in Python☆38Updated 8 months ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Python package used to convert Jupyter Noteboks into Jekyll ready documents including validation and version control tagging☆21Updated 6 years ago
- A python module that will check for package updates.☆28Updated 3 years ago
- A friendly pandas wrapper with a more composable grammar support.☆14Updated 8 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- The Path of the PyData Ninja☆16Updated 9 years ago
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated last year
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- Python language parser for a tabular format for structured metadata. http://metatab.org☆17Updated last year
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Dexter document monitor for MMA☆16Updated 10 months ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- ☆13Updated 8 years ago