ddelange / retrieLinks
Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing
☆73Updated last month
Alternatives and similar repositories for retrie
Users that are interested in retrie are comparing it to the libraries listed below
Sorting:
- Super lightweight function registries for your library☆179Updated last year
- Confection: the sweetest config system for Python☆186Updated 2 months ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆155Updated last year
- A Python implementation of Lunr.js 🌖☆197Updated 3 months ago
- 🕊️ Radically lightweight command-line interfaces☆107Updated 2 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆470Updated 5 months ago
- Python package for deduplication/entity resolution using active learning☆80Updated 9 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆72Updated last year
- Multi-Langauge Identification☆28Updated 10 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆99Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated last year
- A python package to simulate typographical errors.☆35Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Extract text from HTML☆134Updated 4 years ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- Fuzzy matching and more functionality for spaCy.☆256Updated 11 months ago
- Efficient string matching with regular expressions☆143Updated this week
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- ☆70Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- ☆69Updated 3 years ago
- An open-source package for python to clean raw text data☆70Updated last year
- A Streamlit component for annotating text by text selecting.☆40Updated last year