Super Fast String Matching in Python
☆368Mar 14, 2025Updated last year
Alternatives and similar repositories for string_grouper
Users that are interested in string_grouper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆422Apr 9, 2026Updated last week
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 9 months ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 3 months ago
- ☆11Nov 17, 2017Updated 8 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Oct 6, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- ☆43Apr 20, 2023Updated 2 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Aug 9, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,850Apr 13, 2026Updated last week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Company Name Processor written in Python☆354Jan 16, 2026Updated 3 months ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 3 years ago
- Sentence transformers models for SpaCy☆108Mar 9, 2023Updated 3 years ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,451Jul 29, 2025Updated 8 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆150Oct 16, 2024Updated last year
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,083Updated this week
- A Cython implementation of the affine gap string distance☆57Jan 23, 2023Updated 3 years ago
- Dash Component created from ukrbublik/react-awesome-query-builder☆12Updated this week
- Extra blocks for scikit-learn pipelines.☆1,390Updated this week
- Python package for performing Entity and Text Matching using Deep Learning.☆615Jun 18, 2024Updated last year
- Media search's code☆14Sep 15, 2018Updated 7 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,526Apr 18, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Simple Bulk Labelling Tool☆597Jul 29, 2025Updated 8 months ago
- Estimating Body Fat Using Computer Vision (openCV2, Python)☆22Dec 18, 2014Updated 11 years ago
- Text preprocessing, representation and visualization from zero to hero.☆2,909Aug 29, 2023Updated 2 years ago
- Concurrent (with OLC) Adaptive Radix Trie in Golang.☆11Jul 31, 2020Updated 5 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,207Apr 7, 2026Updated last week
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆525Feb 12, 2026Updated 2 months ago
- Simplifies use of the Dedupe library via Pandas☆135Mar 30, 2023Updated 3 years ago
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆508Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Example of configuring multiplage apps via a custom config file☆18Nov 14, 2023Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆363Mar 20, 2026Updated 3 weeks ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 2 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,211Feb 15, 2026Updated 2 months ago
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- A Python library for generating word tree diagrams☆28Jul 10, 2020Updated 5 years ago