stopwords-iso / stopwords-en
English stopwords collection
☆159Updated 8 years ago
Alternatives and similar repositories for stopwords-en:
Users that are interested in stopwords-en are comparing it to the libraries listed below
- Default English stopword lists from many different sources☆298Updated last year
- All languages stopwords collection☆437Updated last year
- List of common stop words in various languages.☆337Updated 2 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- LexRank algorithm for text summarization☆231Updated 11 months ago
- Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.☆146Updated 4 years ago
- ☆208Updated 4 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆268Updated last year
- Semantic Orientation Calculator for Sentiment Analysis☆52Updated 2 years ago
- A multilingual lexicon of words to hurt.☆87Updated 4 months ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- The SentiWordNet sentiment lexicon☆327Updated 2 years ago
- Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media☆42Updated 10 months ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆435Updated last year
- A spaCy wrapper for DBpedia Spotlight☆109Updated 2 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆314Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- A Python wrapper around the topic modeling functions of MALLET.☆101Updated 5 months ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆373Updated 6 months ago
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics☆210Updated last year
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆75Updated last year
- A python implementation of the Rapid Automatic Keyword Extraction☆373Updated 7 years ago
- A machine learning tool for fishing entities☆263Updated this week
- Linguistic Inquiry and Word Count (LIWC) analyzer☆208Updated 3 years ago
- Guidelines.☆96Updated 7 months ago
- Termonology Extraction Program (English Version)☆43Updated 8 months ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆377Updated 4 months ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignment☆38Updated 7 years ago
- Keyword extraction with Word2Vec☆46Updated 4 years ago
- A Python function to break down hashtags or compound words created by putting together multiple words☆33Updated 9 years ago