dedupeio / fuzzycategoryLinks
Fuzzy Categorical Distances
☆14Updated 5 years ago
Alternatives and similar repositories for fuzzycategory
Users that are interested in fuzzycategory are comparing it to the libraries listed below
Sorting:
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 3 weeks ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- An index data structure for approximate string search.☆23Updated 6 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 3 weeks ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Streaming newline delimited JSON I/O.☆12Updated last year
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 6 months ago
- ☆52Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated 10 months ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated last year
- ☆13Updated 6 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆30Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Graph extraction and NLP analysis for Baleen Corpora☆18Updated 8 years ago
- Utilities for working with data.☆20Updated 10 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- ☆30Updated 2 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- a set of services that provide NLP facilities☆25Updated 4 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago