Super Fast String Matching in Python
☆371Mar 14, 2025Updated 11 months ago
Alternatives and similar repositories for string_grouper
Users that are interested in string_grouper are comparing it to the libraries listed below
Sorting:
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆420Jan 12, 2026Updated last month
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated last month
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 3 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Aug 9, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,740Jan 26, 2026Updated last month
- ☆43Apr 20, 2023Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- A Flexible Deep Learning Approach to Fuzzy String Matching☆150Oct 16, 2024Updated last year
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,438Jul 29, 2025Updated 7 months ago
- Dash Component created from ukrbublik/react-awesome-query-builder☆12Feb 23, 2026Updated last week
- Extra blocks for scikit-learn pipelines.☆1,382Updated this week
- Python package for performing Entity and Text Matching using Deep Learning.☆614Jun 18, 2024Updated last year
- Company Name Processor written in Python☆351Jan 16, 2026Updated last month
- A Simple Bulk Labelling Tool☆599Jul 29, 2025Updated 7 months ago
- Concurrent (with OLC) Adaptive Radix Trie in Golang.☆11Jul 31, 2020Updated 5 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Oct 29, 2019Updated 6 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated last month
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆1,980Updated this week
- Text preprocessing, representation and visualization from zero to hero.☆2,915Aug 29, 2023Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- Example of configuring multiplage apps via a custom config file☆18Nov 14, 2023Updated 2 years ago
- Media search's code☆15Sep 15, 2018Updated 7 years ago
- ☆10Jul 28, 2022Updated 3 years ago
- Sentence transformers models for SpaCy☆108Mar 9, 2023Updated 2 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 2 weeks ago
- NER, syntax markup visualizations☆140Feb 9, 2026Updated 3 weeks ago
- A Python library for calculating a large variety of metrics from text☆360Jan 30, 2026Updated last month
- Simple and efficient access to genomic data for deep learning models.☆42Jan 9, 2020Updated 6 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Oct 17, 2019Updated 6 years ago
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆506Feb 24, 2026Updated last week
- 📊 A working, maintained copy of ggpy☆31Jul 11, 2019Updated 6 years ago
- This repository highlights the workflow and ease of use of training machine learning or deep learning models using Azure Databricks. Then…☆33Feb 1, 2024Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Jan 23, 2023Updated 3 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,517Apr 18, 2025Updated 10 months ago
- Rust UMI Directional Adjacency Deduplicator☆15Nov 25, 2019Updated 6 years ago