Alir3z4 / stop-wordsLinks
List of common stop words in various languages.
☆337Updated 2 years ago
Alternatives and similar repositories for stop-words
Users that are interested in stop-words are comparing it to the libraries listed below
Sorting:
- Default English stopword lists from many different sources☆303Updated 2 years ago
- All languages stopwords collection☆451Updated last year
- English stopwords collection☆162Updated 8 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆385Updated 10 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆376Updated 2 years ago
- ☆129Updated 3 years ago
- Elegant and Easy Tweet Preprocessing in Python☆308Updated 2 years ago
- AFINN sentiment analysis in Python☆465Updated 3 years ago
- Stopwords for 50 languages in JSON format☆430Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated last year
- Get list of common stop words in various languages in Python☆156Updated last year
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated last month
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆748Updated 2 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆634Updated 4 years ago
- Python wrapper for LanguageTool grammar checker☆329Updated 3 years ago
- Extract dates from text☆64Updated 4 years ago
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆524Updated 8 months ago
- The SentiWordNet sentiment lexicon☆331Updated 3 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆239Updated 10 months ago
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆780Updated 3 years ago
- 📂 Additional lookup tables and data resources for spaCy☆107Updated last month
- Quickly extract multi-word phrases from a corpus☆191Updated 5 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆341Updated 3 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- Wikidata client library for Python☆355Updated last year
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆670Updated last month
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆153Updated 2 years ago