fujimotos / polyleven
A Fast Levenshtein Distance Library for Python
β82Updated 2 months ago
Alternatives and similar repositories for polyleven:
Users that are interested in polyleven are comparing it to the libraries listed below
- Confection: the sweetest config system for Pythonβ186Updated 2 weeks ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ310Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ71Updated 2 months ago
- Super lightweight function registries for your libraryβ179Updated 10 months ago
- π€ Push your spaCy pipelines to the Hugging Face Hubβ43Updated 10 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ114Updated last month
- π§ͺ Cutting-edge experimental spaCy components and featuresβ98Updated last year
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β463Updated 3 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β153Updated 11 months ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- β69Updated 3 years ago
- string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].β61Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Library for unit extraction - fork of quantulum for python3β138Updated 10 months ago
- Fuzzy matching and more functionality for spaCy.β256Updated 9 months ago
- Abydos NLP/IR library for Pythonβ185Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Bag of, not words, but tricks!β68Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated last year
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β151Updated last year
- β43Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β245Updated last year
- Custom Natural Language Processing with big and small models π²π±β68Updated 3 years ago
- Repo originally for a talk at Normconfβ21Updated 2 years ago
- Find strings/words in text; convenience and C speedβ126Updated 2 years ago
- A python package to simulate typographical errors.β34Updated last year
- Parse natural language time expressions in pythonβ130Updated 2 years ago
- A Streamlit component for annotating text by text selecting.β40Updated 10 months ago
- Text tokenization and sentence segmentation (segtok v2)β201Updated 3 years ago