Super Fast String Matching in Python
☆369Mar 14, 2025Updated last year
Alternatives and similar repositories for string_grouper
Users that are interested in string_grouper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆422Apr 9, 2026Updated last month
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 10 months ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 3 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Oct 6, 2022Updated 3 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,049Feb 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆43Apr 20, 2023Updated 3 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Aug 9, 2022Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,888Updated this week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Company Name Processor written in Python☆354Jan 16, 2026Updated 3 months ago
- Generate reports for spaCy models.☆29May 27, 2022Updated 3 years ago
- Sentence transformers models for SpaCy☆108Mar 9, 2023Updated 3 years ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- Match Patent Assignees with Compustat and SDC via Bing Search☆54Sep 29, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,461Jul 29, 2025Updated 9 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆151Oct 16, 2024Updated last year
- ☆13Sep 2, 2021Updated 4 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,134Updated this week
- A Cython implementation of the affine gap string distance☆57Jan 23, 2023Updated 3 years ago
- source{d} MLonCode foundation - core algorithms and models.☆13Oct 17, 2019Updated 6 years ago
- Extra blocks for scikit-learn pipelines.☆1,391Updated this week
- Python package for performing Entity and Text Matching using Deep Learning.☆615Jun 18, 2024Updated last year
- Media search's code☆14Sep 15, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,528Apr 18, 2025Updated last year
- Text preprocessing, representation and visualization from zero to hero.☆2,910Aug 29, 2023Updated 2 years ago
- Concurrent (with OLC) Adaptive Radix Trie in Golang.☆12Jul 31, 2020Updated 5 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,210Apr 7, 2026Updated last month
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆525Feb 12, 2026Updated 2 months ago
- Simplifies use of the Dedupe library via Pandas☆135Mar 30, 2023Updated 3 years ago
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆509Apr 30, 2026Updated last week
- Example of configuring multiplage apps via a custom config file☆18Nov 14, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Python library for calculating a large variety of metrics from text☆363Mar 20, 2026Updated last month
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,212Apr 22, 2026Updated 2 weeks ago
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- A Python library for generating word tree diagrams☆28Jul 10, 2020Updated 5 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆158Jun 12, 2023Updated 2 years ago