taleinat / fuzzysearch
Find parts of long text or data, allowing for some changes/typos.
β318Updated 9 months ago
Alternatives and similar repositories for fuzzysearch:
Users that are interested in fuzzysearch are comparing it to the libraries listed below
- β169Updated last month
- Lightning Fast Language Prediction πβ166Updated 6 years ago
- π Additional lookup tables and data resources for spaCyβ105Updated 3 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ114Updated 2 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β741Updated 2 months ago
- Textpipe: clean and extract metadata from textβ301Updated 3 years ago
- Find strings/words in text; convenience and C speedβ126Updated 2 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β169Updated 3 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β77Updated 3 years ago
- A python module for word inflections designed for use with spaCy.β92Updated 5 years ago
- Parse numbers written in natural languageβ114Updated 6 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ71Updated last week
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β824Updated 2 weeks ago
- A Fast Levenshtein Distance Library for Pythonβ83Updated 2 months ago
- python library to simplify working with jsonlines and ndjson dataβ293Updated 9 months ago
- βοΈContextual word checker for better suggestions (not actively maintained)β413Updated 3 months ago
- Spelling corrector in pythonβ480Updated 4 months ago
- Language independent truecaser in Python.β160Updated 3 years ago
- A fully customisable language detection pipeline for spaCyβ92Updated 6 years ago
- β500Updated 2 months ago
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β465Updated 3 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β151Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ154Updated 5 months ago
- A python true casing utility that restores case information for textsβ88Updated 2 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engineβ186Updated 4 years ago
- Levenshtein and Hamming distance computationβ116Updated 5 years ago
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- A fast and memory-optimized string library for heavy-text manipulation in Pythonβ250Updated 5 years ago
- Fuzzy matching and more functionality for spaCy.β256Updated 10 months ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.β316Updated 2 months ago