fujimotos / polyleven
Fast Levenshtein Distance Library for Python 3
☆82Updated 2 years ago
Alternatives and similar repositories for polyleven:
Users that are interested in polyleven are comparing it to the libraries listed below
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆109Updated last month
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆68Updated 2 weeks ago
- Find parts of long text or data, allowing for some changes/typos.☆313Updated 6 months ago
- Confection: the sweetest config system for Python☆182Updated 8 months ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆44Updated 8 months ago
- Annotation tool on Jupyter for Named Entity Recognition tasks☆21Updated 11 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆307Updated last year
- Super lightweight function registries for your library☆177Updated 8 months ago
- Sentence transformers models for SpaCy☆107Updated last year
- ☆168Updated 8 months ago
- A python package to simulate typographical errors.☆31Updated last year
- Library for unit extraction - fork of quantulum for python3☆136Updated 7 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 11 months ago
- Vectorizers for a range of different data types☆100Updated last week
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 9 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆168Updated 3 months ago
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- ☆42Updated last year
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆42Updated 4 years ago
- ☆68Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 8 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆157Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆201Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆298Updated last month
- Python text processing, pattern matching, and NLP framework☆63Updated last year
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆449Updated last month
- NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.☆18Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Abydos NLP/IR library for Python☆184Updated 2 years ago