fujimotos / polyleven
A Fast Levenshtein Distance Library for Python
β82Updated 2 weeks ago
Alternatives and similar repositories for polyleven:
Users that are interested in polyleven are comparing it to the libraries listed below
- Super lightweight function registries for your libraryβ177Updated 9 months ago
- Confection: the sweetest config system for Pythonβ183Updated 9 months ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ309Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ110Updated last week
- π€ Push your spaCy pipelines to the Hugging Face Hubβ43Updated 9 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ69Updated last month
- Annotation tool on Jupyter for Named Entity Recognition tasksβ21Updated last year
- π§ͺ Cutting-edge experimental spaCy components and featuresβ96Updated 10 months ago
- Library for unit extraction - fork of quantulum for python3β136Updated 8 months ago
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Fuzzy matching and more functionality for spaCy.β255Updated 8 months ago
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β457Updated last month
- Abydos NLP/IR library for Pythonβ185Updated 2 years ago
- A Python implementation of Lunr.js πβ196Updated this week
- β42Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 9 months ago
- NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.β18Updated last year
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ76Updated 6 months ago
- Few-shot Named Entity Recognitionβ123Updated 2 years ago
- Blue Brain text mining toolbox for semantic search and structured information extractionβ44Updated last year
- β68Updated 2 years ago
- Vectorizers for a range of different data typesβ101Updated last month
- A comprehensive and scalable set of string tokenizers and similarity measures in Pythonβ136Updated 7 months ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β244Updated last year
- super fast cpp implementation of longest common subsequence/substringβ24Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β193Updated 2 years ago
- Bag of, not words, but tricks!β68Updated last year