dedupeio / fuzzycategory
Fuzzy Categorical Distances
☆14Updated 5 years ago
Alternatives and similar repositories for fuzzycategory:
Users that are interested in fuzzycategory are comparing it to the libraries listed below
- A maximum-strength name parser for record linkage.☆37Updated this week
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Python library to infer date format from examples☆43Updated 3 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- Dedupe/batch geocode addresses and venues around the world with libpostal☆82Updated 3 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 5 months ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 7 months ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- agate-sql adds SQL read/write support to agate.☆18Updated 2 months ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 7 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Streaming newline delimited JSON I/O.☆12Updated last year
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated last year
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- ☆13Updated 6 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- An advanced yet simple system to run your background tasks and workflows☆20Updated 7 years ago
- Postgresql utilities for ETL and data analysis☆24Updated 7 years ago
- Enhance your feature engineering workflow with Kodiak☆19Updated last year
- Predict age and gender from a first name☆60Updated 6 years ago
- Versioned domain model. Python library for revisioning/versioning of databases.☆44Updated 4 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- RESTful API around the PETRARCH coding software☆10Updated 4 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago