Alir3z4 / stop-wordsLinks
List of common stop words in various languages.
☆337Updated 2 years ago
Alternatives and similar repositories for stop-words
Users that are interested in stop-words are comparing it to the libraries listed below
Sorting:
- Default English stopword lists from many different sources☆308Updated 2 years ago
- All languages stopwords collection☆454Updated last year
- Stopwords for 50 languages in JSON format☆433Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆342Updated 3 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- English stopwords collection☆162Updated 8 years ago
- Get list of common stop words in various languages in Python☆156Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆376Updated 2 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 3 months ago
- A compound word splitter for Python☆49Updated 4 years ago
- Extended list of German stopwords for use in Web Projects, Search Engines or every thing else.☆105Updated 6 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆182Updated 2 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆239Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆108Updated 3 months ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆749Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆254Updated 2 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆386Updated last month
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated 2 years ago
- A python module for English lemmatization and inflection.☆270Updated 2 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆130Updated 3 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 9 months ago
- Universal Dependencies online documentation☆289Updated this week
- Fixes contractions such as `you're` to `you are`☆317Updated 2 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- AFINN sentiment analysis in Python☆467Updated 3 years ago
- ☆129Updated 3 years ago