Alir3z4 / stop-words
List of common stop words in various languages.
☆332Updated 2 years ago
Alternatives and similar repositories for stop-words:
Users that are interested in stop-words are comparing it to the libraries listed below
- All languages stopwords collection☆427Updated last year
- English stopwords collection☆155Updated 8 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆747Updated 2 years ago
- Default English stopword lists from many different sources☆294Updated last year
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆269Updated last year
- Stopwords for 50 languages in JSON format☆428Updated last year
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆773Updated 2 years ago
- Various Algorithms for Short Text Mining☆466Updated last week
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- Extract dates from text☆64Updated 4 years ago
- Quickly extract multi-word phrases from a corpus☆190Updated 4 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆236Updated 5 months ago
- 📂 Additional lookup tables and data resources for spaCy☆99Updated this week
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- ☆167Updated 7 months ago
- A python implementation of the Rapid Automatic Keyword Extraction☆972Updated 4 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 3 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆664Updated 11 months ago
- HackDelft☆81Updated 7 years ago
- Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby☆600Updated 7 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 4 years ago
- Get list of common stop words in various languages in Python☆156Updated 10 months ago
- A compound word splitter for Python☆48Updated 3 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆368Updated 2 years ago
- PYthon Automated Term Extraction☆310Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆310Updated 2 weeks ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆730Updated 5 months ago