fujimotos / polyleven
A Fast Levenshtein Distance Library for Python
β83Updated 2 months ago
Alternatives and similar repositories for polyleven
Users that are interested in polyleven are comparing it to the libraries listed below
Sorting:
- Confection: the sweetest config system for Pythonβ186Updated last month
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ311Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ114Updated 2 months ago
- Fuzzy matching and more functionality for spaCy.β256Updated 10 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ73Updated 2 weeks ago
- Super lightweight function registries for your libraryβ179Updated 11 months ago
- Library for unit extraction - fork of quantulum for python3β138Updated 10 months ago
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ161Updated 2 years ago
- π€ Push your spaCy pipelines to the Hugging Face Hubβ44Updated 11 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-sβ¦β214Updated 3 months ago
- Find parts of long text or data, allowing for some changes/typos.β318Updated 9 months ago
- β69Updated 3 years ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly througβ¦β43Updated 4 years ago
- β43Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β154Updated 11 months ago
- Python package for deduplication/entity resolution using active learningβ79Updated 8 months ago
- A Streamlit component for annotating text by text selecting.β40Updated 11 months ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Pythonβ138Updated 10 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β245Updated last year
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- Abydos NLP/IR library for Pythonβ186Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEvalβ13β179Updated last month
- Bag of, not words, but tricks!β68Updated last year
- Text tokenization and sentence segmentation (segtok v2)β202Updated 3 years ago
- Few-shot Named Entity Recognitionβ123Updated 3 years ago