6 / stopwords-json
Stopwords for 50 languages in JSON format
☆429Updated last year
Alternatives and similar repositories for stopwords-json:
Users that are interested in stopwords-json are comparing it to the libraries listed below
- All languages stopwords collection☆437Updated last year
- List of common stop words in various languages.☆337Updated 2 years ago
- displaCy.js: An open-source NLP visualiser for the modern web☆344Updated 6 years ago
- TextRank implementation for Python 3.☆1,255Updated 2 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- Yet another Python binding for fastText☆226Updated 6 years ago
- a collection of functions that measure the readability of a given body of text☆191Updated 7 years ago
- Code for the word2vec HTTP server running at https://rare-technologies.com/word2vec-tutorial/#bonus_app☆158Updated 7 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆199Updated 6 years ago
- Data for Automatic Keyphrase Extraction Task☆336Updated 6 years ago
- ADAM - A Question Answering System. Inspired from IBM Watson☆355Updated 5 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆746Updated 2 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆378Updated 4 months ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- word2vec Google News model slimmed down to 300k English words☆215Updated 7 years ago
- SemCor and Masc documents annotated with NOAD word senses.☆183Updated 5 years ago
- Natural Language Engine on WikiData☆436Updated 8 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆265Updated 2 years ago
- A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )☆252Updated 2 years ago
- Fast, DB Backed pretrained word embeddings for natural language processing.☆222Updated last year
- CRF to detect named entities (primarily names of people)☆119Updated 7 years ago
- Code and pre-trained model for: Deep Semantic Role Labeling: What Works and What's Next☆332Updated 6 years ago
- A python implementation of the Rapid Automatic Keyword Extraction☆373Updated 7 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆213Updated 3 years ago
- Different datasets for developing and testing keyword extraction algorithms☆109Updated 9 years ago
- An example application using Word2Vec. Given a list of words, it finds the one which isn't 'like' the others - a typical language underst…☆288Updated 11 years ago
- Bitextor generates translation memories from multilingual websites☆292Updated 4 months ago
- Named Entity Recognition Tool☆1,164Updated 5 years ago