chrislit / abydosLinks
Abydos NLP/IR library for Python
β193Updated 3 years ago
Alternatives and similar repositories for abydos
Users that are interested in abydos are comparing it to the libraries listed below
Sorting:
- Lightning Fast Language Prediction πβ167Updated 3 months ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Pythonβ142Updated last year
- Parse natural language time expressions in pythonβ131Updated 3 years ago
- Hunspell extension for spaCy 2.0.β94Updated last year
- A Python module to convert natural language numerics into ints and floats.β233Updated last year
- Textpipe: clean and extract metadata from textβ302Updated 4 years ago
- Fuzzy matching and more functionality for spaCy.β259Updated last year
- Library for unit extraction - fork of quantulum for python3β145Updated last year
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ327Updated 7 months ago
- π Additional lookup tables and data resources for spaCyβ113Updated 6 months ago
- Language detection extension for spaCy 2.0+β114Updated 6 years ago
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β478Updated 2 weeks ago
- β69Updated 3 years ago
- A Python 3 phonetics library.β136Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)β208Updated 3 years ago
- Group thousands of similar spreadsheet or database text entries in secondsβ157Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ77Updated this week
- PYthon Automated Term Extractionβ317Updated 2 years ago
- Super Fast String Matching in Pythonβ372Updated 8 months ago
- A compound word splitter for Pythonβ49Updated 4 years ago
- π Emoji handling and meta data for spaCy with custom extension attributesβ182Updated 2 years ago
- Dataframe Integration with spaCy.β103Updated 4 years ago
- Super lightweight function registries for your libraryβ180Updated last year
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β155Updated 2 years ago
- Scalable String Similarity Joins in Pythonβ39Updated last year
- Sentence transformers models for SpaCyβ109Updated 2 years ago
- βοΈContextual word checker for better suggestions (not actively maintained)β418Updated 10 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.β150Updated 11 months ago
- Information extraction from English and German texts based on predicate logicβ139Updated 2 years ago