dedupeio / fuzzycategoryLinks
Fuzzy Categorical Distances
☆14Updated 5 years ago
Alternatives and similar repositories for fuzzycategory
Users that are interested in fuzzycategory are comparing it to the libraries listed below
Sorting:
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last week
- Algorithms for "schema matching"☆26Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated last week
- A maximum-strength name parser for record linkage.☆37Updated last week
- Streaming newline delimited JSON I/O.☆12Updated last year
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Scalable String Similarity Joins in Python☆39Updated 11 months ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- ☆13Updated 6 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 8 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆63Updated 5 years ago
- a set of services that provide NLP facilities☆25Updated 4 years ago
- Just charts. Really.☆22Updated last year
- Inspect a URL and estimate if it contains a news story☆39Updated 7 months ago
- ☆30Updated 3 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 6 months ago
- View a list of JSON-serializable dictionaries or a 2-D array, in HandsOnTable, in Jupyter Notebook.☆13Updated 6 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- Python library to infer date format from examples☆43Updated 3 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- Force-Atlas 2 graph layout in networkx☆22Updated 10 years ago