Super Fast String Matching in Python
☆370Jun 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for string_grouper
Users that are interested in string_grouper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆423Updated this week
- Fuzzy string matching, grouping, and evaluation.☆798Jul 10, 2025Updated 11 months ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 5 months ago
- ☆11Nov 17, 2017Updated 8 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Oct 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,051Feb 21, 2024Updated 2 years ago
- ☆43Apr 20, 2023Updated 3 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Aug 9, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,952Jun 8, 2026Updated last week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆925Sep 2, 2024Updated last year
- Company Name Processor written in Python☆355Jan 16, 2026Updated 5 months ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 4 years ago
- Sentence transformers models for SpaCy☆108Mar 9, 2023Updated 3 years ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,479Jul 29, 2025Updated 10 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆151Oct 16, 2024Updated last year
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,202Jun 12, 2026Updated last week
- A Cython implementation of the affine gap string distance☆57Jan 23, 2023Updated 3 years ago
- source{d} MLonCode foundation - core algorithms and models.☆13Oct 17, 2019Updated 6 years ago
- Extra blocks for scikit-learn pipelines.☆1,398Jun 12, 2026Updated last week
- Python package for performing Entity and Text Matching using Deep Learning.☆620Jun 18, 2024Updated 2 years ago
- Media search's code☆14Sep 15, 2018Updated 7 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,533Apr 18, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Estimating Body Fat Using Computer Vision (openCV2, Python)☆23Dec 18, 2014Updated 11 years ago
- A Simple Bulk Labelling Tool☆597Jul 29, 2025Updated 10 months ago
- Text preprocessing, representation and visualization from zero to hero.☆2,912Aug 29, 2023Updated 2 years ago
- Concurrent (with OLC) Adaptive Radix Trie in Golang.☆12Jul 31, 2020Updated 5 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,219Apr 7, 2026Updated 2 months ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆526Feb 12, 2026Updated 4 months ago
- Simplifies use of the Dedupe library via Pandas☆135Mar 30, 2023Updated 3 years ago
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆512Jun 12, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Example of configuring multiplage apps via a custom config file☆18Nov 14, 2023Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆367May 5, 2026Updated last month
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 4 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,213Apr 22, 2026Updated last month
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- A Python library for generating word tree diagrams☆28Jul 10, 2020Updated 5 years ago