stopwords-iso / stopwords-en
English stopwords collection
☆160Updated 8 years ago
Alternatives and similar repositories for stopwords-en:
Users that are interested in stopwords-en are comparing it to the libraries listed below
- Default English stopword lists from many different sources☆298Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆336Updated 3 years ago
- All languages stopwords collection☆439Updated last year
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆375Updated 7 months ago
- List of common stop words in various languages.☆337Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆255Updated 7 months ago
- Entity linking system for Wikidata updated by your edits in real time☆254Updated 4 months ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode…☆215Updated 5 years ago
- LexRank algorithm for text summarization☆230Updated last year
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- Universal Dependencies online documentation☆282Updated this week
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Entity Linking system by A3 lab☆68Updated 6 years ago
- A multilingual lexicon of words to hurt.☆89Updated 5 months ago
- Implementation of the ClausIE information extraction system for python+spacy☆222Updated 2 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- Enhanced Subject Word Object Extraction☆151Updated 3 weeks ago
- Guidelines.☆96Updated 8 months ago
- Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in …☆128Updated 5 years ago
- Linguistic Inquiry and Word Count (LIWC) analyzer☆210Updated 3 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆352Updated 2 years ago
- Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.☆146Updated 4 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆219Updated 9 months ago
- ☆208Updated 4 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 4 years ago
- Palmetto is a quality measuring tool for topics☆216Updated last year
- GSDMM: Short text clustering☆355Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago