taleinat / fuzzysearchLinks
Find parts of long text or data, allowing for some changes/typos.
☆324Updated last month
Alternatives and similar repositories for fuzzysearch
Users that are interested in fuzzysearch are comparing it to the libraries listed below
Sorting:
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- ☆171Updated 3 months ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆831Updated 2 months ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 11 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆152Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆747Updated 2 weeks ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆247Updated last year
- Fuzzy string matching, grouping, and evaluation.☆764Updated last month
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Abydos NLP/IR library for Python☆186Updated 2 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆469Updated 5 months ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆105Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆114Updated 3 months ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- A simple client for doccano API.☆85Updated last year
- Python wrapper for Stanford CoreNLP's SUTime☆154Updated 2 years ago
- ☆517Updated last month
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- A Fast Levenshtein Distance Library for Python☆83Updated 4 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆195Updated 2 years ago
- spellchecking library for python☆610Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆259Updated 9 months ago
- Library for unit extraction - fork of quantulum for python3☆141Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆73Updated last month
- A python module for English lemmatization and inflection.☆268Updated last year