dedupeio / fuzzycategory
Fuzzy Categorical Distances
☆14Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for fuzzycategory
- An index data structure for approximate string search.☆23Updated 5 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- A maximum-strength name parser for record linkage.☆32Updated 3 months ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆82Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Python library to infer date format from examples☆42Updated 2 years ago
- Algorithms for "schema matching"☆25Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated last month
- A Cython implementation of the affine gap string distance☆58Updated last year
- Fuzzy matching for companies'names☆10Updated 5 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- Enhance your feature engineering workflow with Kodiak☆20Updated last year
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 8 years ago
- Provide partial dates and retain the date precision through processing☆13Updated last year
- CSV on the web☆37Updated 2 weeks ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 7 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- mltk - Moz Language Tool Kit☆12Updated 9 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.☆11Updated last year
- Pandas-SQLAlchemy integration☆28Updated 8 months ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- Predict age and gender from a first name☆60Updated 6 years ago