indix / whatthelangLinks
Lightning Fast Language Prediction π
β167Updated 6 years ago
Alternatives and similar repositories for whatthelang
Users that are interested in whatthelang are comparing it to the libraries listed below
Sorting:
- π Emoji handling and meta data for spaCy with custom extension attributesβ181Updated 2 years ago
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- Language detection extension for spaCy 2.0+β113Updated 6 years ago
- Hunspell extension for spaCy 2.0.β94Updated last year
- A python module for word inflections designed for use with spaCy.β92Updated 5 years ago
- Textpipe: clean and extract metadata from textβ302Updated 4 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β77Updated 3 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β154Updated 2 years ago
- spaCy + UDPipeβ162Updated 3 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.β98Updated 4 years ago
- Convert number words (eg. twenty one) to numeric digits (21)β177Updated last year
- Misspelling Oblivious Word Embeddingsβ201Updated 6 years ago
- Language independent truecaser in Python.β159Updated 3 years ago
- A compound word splitter for Pythonβ48Updated 3 years ago
- Text tokenization and sentence segmentation (segtok v2)β205Updated 3 years ago
- π€ΉββοΈ Query spaCy's linguistic annotations using GraphQLβ86Updated 7 years ago
- β172Updated 4 months ago
- Abydos NLP/IR library for Pythonβ188Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapersβ171Updated 2 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β105Updated 2 years ago
- β70Updated 2 years ago
- π Additional lookup tables and data resources for spaCyβ108Updated 2 months ago
- Fast supervised sentence boundary detection using the averaged perceptronβ90Updated 6 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtitiesβ117Updated last month
- Use ML-Annotate to label data for machine learning purposesβ110Updated 5 years ago
- A python true casing utility that restores case information for textsβ89Updated 2 years ago
- Language Tool style grammar handling with spaCy 2.0β42Updated 7 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.β56Updated 6 years ago
- Character-based word embeddings model based on RNN for handling real worldΒ textsβ173Updated last year