taleinat / fuzzysearchLinks
Find parts of long text or data, allowing for some changes/typos.
☆319Updated last week
Alternatives and similar repositories for fuzzysearch
Users that are interested in fuzzysearch are comparing it to the libraries listed below
Sorting:
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆247Updated last year
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆827Updated last month
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆153Updated last year
- Super lightweight function registries for your library☆179Updated last year
- A python module for English lemmatization and inflection.☆268Updated last year
- Spelling corrector in python☆482Updated 5 months ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆744Updated 2 weeks ago
- ☆171Updated 2 months ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆375Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆105Updated this week
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 11 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆114Updated 3 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆468Updated 4 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 4 months ago
- Python wrapper for Stanford CoreNLP's SUTime☆154Updated 2 years ago
- Python3 implementation of the Schwartz-Hearst algorithm for extracting abbreviation-definition pairs☆88Updated last year
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- Parse numbers written in natural language☆116Updated 7 months ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆73Updated last month
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- Pythonic search engine based on PyLucene.☆127Updated 6 months ago
- NER, syntax markup visualizations☆139Updated last year
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago