taleinat/fuzzysearch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/taleinat/fuzzysearch)

taleinat / fuzzysearch

Find parts of long text or data, allowing for some changes/typos.

☆342

Alternatives and similar repositories for fuzzysearch

Users that are interested in fuzzysearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

seatgeek / fuzzywuzzy
View on GitHub
Fuzzy String Matching in Python
☆9,262Feb 24, 2023Updated 3 years ago
kmike / morphine
View on GitHub
[experiment] CRF-based disambiguation engine for pymorphy2
☆10May 9, 2016Updated 10 years ago
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
wolfgarbe / LinSpell
View on GitHub
Fast approximate strings search & spelling correction
☆61Oct 30, 2021Updated 4 years ago
deeppavlov / ru_sentence_tokenizer
View on GitHub
A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.
☆52Jul 4, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fujimotos / TinyFastSS
View on GitHub
An index data structure for approximate string search.
☆23May 6, 2019Updated 7 years ago
Guangyi-Z / py-aho-corasick
View on GitHub
A pure Python implementation of Aho-Corasick algorithm.
☆23Jul 10, 2018Updated 8 years ago
rapidfuzz / RapidFuzz
View on GitHub
Rapid fuzzy string matching in Python using various string metrics
☆4,034Updated this week
WojciechMula / pyahocorasick
View on GitHub
Python module (C extension and plain python) implementing Aho-Corasick algorithm
☆1,116Apr 27, 2026Updated 2 months ago
zainhoda / orbgo
View on GitHub
Free and open source Tableau alternative that generates Python Pandas code
☆12Aug 23, 2018Updated 7 years ago
seatgeek / thefuzz
View on GitHub
Fuzzy String Matching in Python
☆3,645Mar 3, 2025Updated last year
KennethEnevoldsen / augmenty
View on GitHub
Augmenty is an augmentation library based on spaCy for augmenting texts.
☆156May 24, 2024Updated 2 years ago
hiroshi-manabe / CRFSegmenter
View on GitHub
A multi-language segmenter using high-order CRF.
☆17Feb 27, 2020Updated 6 years ago
shyamupa / xelms
View on GitHub
☆19Dec 19, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TeamHG-Memex / tor-proxy
View on GitHub
a tor socks proxy docker image
☆12Apr 8, 2026Updated 3 months ago
INK-USC / AlpacaTag
View on GitHub
AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)
☆137Jan 5, 2023Updated 3 years ago
neomoha / python-lsi-similarity
View on GitHub
A small code in python to compute semantic similarity between documents (or items) using Latent Semantic Indexing
☆14Oct 23, 2014Updated 11 years ago
kermitt2 / entity-fishing
View on GitHub
A machine learning tool for fishing entities
☆268Feb 27, 2026Updated 4 months ago
sudhamstarun / AwesomeNER
View on GitHub
An implementation of bidirectional LSTM-CRF for Named Entity Relationship on custom corpus with custom word embeddings
☆14Apr 9, 2019Updated 7 years ago
nelson-liu / lexical-semantic-recognition
View on GitHub
☆18Jun 12, 2023Updated 3 years ago
ICLRandD / LegalHackers2019
View on GitHub
This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)
☆17Dec 8, 2022Updated 3 years ago
nlesc-sherlock / spaCy-dutch
View on GitHub
Repository for creating models, vocabulary and other necessities for Dutch in Spacey
☆11Dec 15, 2016Updated 9 years ago
MaartenGr / PolyFuzz
View on GitHub
Fuzzy string matching, grouping, and evaluation.
☆801Jul 10, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ebanalyse / fuzzup
View on GitHub
A Fuzzy Matching Approach for Clustering Strings
☆31Feb 7, 2023Updated 3 years ago
uds-lsv / NoisyNER
View on GitHub
A dataset for realistic evaluation of noisy label methods
☆15Dec 3, 2023Updated 2 years ago
vi3k6i5 / flashtext
View on GitHub
Extract Keywords from sentence or Replace keywords in sentences.
☆5,716Apr 13, 2025Updated last year
NorskRegnesentral / skweak
View on GitHub
skweak: A software toolkit for weak supervision applied to NLP tasks
☆925Sep 2, 2024Updated last year
CogComp / cogcomp-nlpy
View on GitHub
CogComp's light-weight Python NLP annotators
☆115Feb 18, 2019Updated 7 years ago
alno / batch-learn
View on GitHub
☆49Apr 17, 2018Updated 8 years ago
taleinat / levenshtein-search
View on GitHub
A Javascript library for fuzzy substring search.
☆29Jan 4, 2023Updated 3 years ago
alontalmor / oLMpics
View on GitHub
☆46Jan 26, 2020Updated 6 years ago
gandersen101 / spaczz
View on GitHub
Fuzzy matching and more functionality for spaCy.
☆258Jul 6, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
skececi / gptfree
View on GitHub
Building or integrating an LLM wrapper shouldn't take more than 10 minutes.
☆13Feb 1, 2025Updated last year
yahshibu / nested-ner-tacl2020-flair
View on GitHub
Implementation of Nested Named Entity Recognition using Flair
☆24Oct 29, 2021Updated 4 years ago
labdac / Meta-Prod2Vec
View on GitHub
Repository for experiments with MetaProd2Vec and related algorithms.
☆59Mar 16, 2019Updated 7 years ago
kyunghyuncho / skip-thoughts
View on GitHub
☆23Jun 30, 2015Updated 11 years ago
nipunsadvilkar / pySBD
View on GitHub
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
☆927Aug 20, 2024Updated last year
scrapinghub / scrapy-mosquitera
View on GitHub
Restrict crawl and scraping scope using matchers.
☆26Jun 8, 2016Updated 10 years ago
pd3f / pd3f-core
View on GitHub
📑 Python Package to reconstruct the original continuous text from PDFs with language models
☆33Sep 8, 2023Updated 2 years ago