ddelange / retrie
Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing
☆69Updated last month
Alternatives and similar repositories for retrie:
Users that are interested in retrie are comparing it to the libraries listed below
- Multi-Langauge Identification☆29Updated 7 months ago
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- ☆30Updated 2 years ago
- Super lightweight function registries for your library☆177Updated 9 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆150Updated last year
- A Python implementation of Lunr.js 🌖☆196Updated last week
- Confection: the sweetest config system for Python☆183Updated 9 months ago
- Bag of, not words, but tricks!☆68Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆97Updated 10 months ago
- Efficient string matching with regular expressions☆141Updated last week
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- 🕊️ Radically lightweight command-line interfaces☆105Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- ☆68Updated 3 years ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- A python package to simulate typographical errors.☆32Updated last year
- ☆70Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆67Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆76Updated 6 months ago
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated last week
- A compound word splitter for Python☆48Updated 3 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- Finds linguistic patterns effortlessly☆35Updated last year