keredson / wordninja
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
☆848Updated 2 years ago
Alternatives and similar repositories for wordninja
Users that are interested in wordninja are comparing it to the libraries listed below
Sorting:
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆375Updated 2 years ago
- Python Keyphrase Extraction module☆1,581Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆825Updated 2 weeks ago
- Single-document unsupervised keyword extraction☆1,721Updated 2 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆744Updated 2 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,383Updated 3 months ago
- Compute Sentence Embeddings Fast!☆623Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆761Updated last week
- Beautiful visualizations of how language differs among document types.☆2,301Updated 2 weeks ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 3 months ago
- A python module for English lemmatization and inflection.☆268Updated last year
- 🦆 Contextually-keyed word vectors☆1,650Updated 3 weeks ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆850Updated 8 months ago
- Multilingual text (NLP) processing toolkit☆2,335Updated last year
- A library implementing different string similarity and distance measures using Python.☆1,007Updated 2 years ago
- TextRank implementation for Python 3.☆1,256Updated 2 years ago
- NLP, before and after spaCy☆2,225Updated last year
- spellchecking library for python☆609Updated 10 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆693Updated last year
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆268Updated last year
- A python binding for crfsuite☆773Updated 7 months ago
- A tool for learning vector representations of words and entities from Wikipedia☆955Updated last year
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,173Updated 9 months ago
- Port of Google's language-detection library to Python.☆1,796Updated 2 months ago
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.☆1,066Updated 2 years ago
- PYthon Automated Term Extraction☆311Updated 2 years ago
- ☆170Updated last month
- 💫 Models for the spaCy Natural Language Processing (NLP) library☆1,739Updated 7 months ago