dedupeio / fuzzycategory
Fuzzy Categorical Distances
☆14Updated 5 years ago
Alternatives and similar repositories for fuzzycategory:
Users that are interested in fuzzycategory are comparing it to the libraries listed below
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated last week
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 6 months ago
- Scalable String Similarity Joins in Python☆39Updated 8 months ago
- Enhance your feature engineering workflow with Kodiak☆19Updated last year
- ☆13Updated 5 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆149Updated 2 months ago
- ☆19Updated 6 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 4 months ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Utilities for working with data.☆20Updated 10 years ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- A browser user interface for manual labeling of record pairs.☆46Updated last year
- Algorithms for "schema matching"☆26Updated 8 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- a set of services that provide NLP facilities☆25Updated 4 years ago
- Streaming newline delimited JSON I/O.☆12Updated last year
- ☆30Updated 2 years ago
- ☆21Updated 6 years ago
- A trend viewer written in Python/JavaScript☆21Updated 4 months ago
- View a list of JSON-serializable dictionaries or a 2-D array, in HandsOnTable, in Jupyter Notebook.☆13Updated 6 years ago