fujimotos / polyleven
Fast Levenshtein Distance Library for Python 3
☆79Updated last year
Related projects: ⓘ
- Confection: the sweetest config system for Python☆175Updated 3 months ago
- ☆46Updated this week
- Super lightweight function registries for your library☆172Updated 3 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆66Updated 2 weeks ago
- Fuzzy matching and more functionality for spaCy.☆249Updated 2 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆91Updated 5 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆425Updated 2 months ago
- Abydos NLP/IR library for Python☆180Updated last year
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆42Updated 3 months ago
- Library for unit extraction - fork of quantulum for python3☆134Updated 2 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆283Updated 10 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆41Updated 3 years ago
- Find parts of long text or data, allowing for some changes/typos.☆302Updated last month
- Bag of, not words, but tricks!☆68Updated 10 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆149Updated 3 months ago
- ☆65Updated 2 years ago
- Sentence transformers models for SpaCy☆104Updated last year
- Annotation tool on Jupyter for Named Entity Recognition tasks☆21Updated 6 months ago
- A Python module to convert natural language numerics into ints and floats.☆211Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 6 months ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆160Updated 2 weeks ago
- ☆41Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆148Updated last year
- A Vectorized Python Dict/Set☆115Updated last year
- A Python implementation of Lunr.js 🌖☆188Updated last week
- Python 3 library to store memory mappable objects into pickle-compatible files☆37Updated 6 years ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆92Updated last month
- Python package for deduplication/entity resolution using active learning☆77Updated 3 weeks ago
- Few-shot Named Entity Recognition☆121Updated 2 years ago