chrislit / abydosLinks
Abydos NLP/IR library for Python
☆186Updated 2 years ago
Alternatives and similar repositories for abydos
Users that are interested in abydos are comparing it to the libraries listed below
Sorting:
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆140Updated 11 months ago
- Fuzzy matching and more functionality for spaCy.☆256Updated last year
- Hunspell extension for spaCy 2.0.☆94Updated 11 months ago
- A Python module to convert natural language numerics into ints and floats.☆228Updated 9 months ago
- Library for unit extraction - fork of quantulum for python3☆141Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆105Updated last month
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆314Updated 2 months ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆73Updated last week
- Super Fast String Matching in Python☆370Updated 3 months ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆166Updated last month
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆152Updated 2 years ago
- A Python 3 phonetics library.☆133Updated 5 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- ☆69Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆217Updated 5 months ago
- spaCy + UDPipe☆161Updated 3 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆146Updated 7 months ago
- PYthon Automated Term Extraction☆314Updated 2 years ago