6 / stopwords-json
Stopwords for 50 languages in JSON format
☆428Updated last year
Alternatives and similar repositories for stopwords-json:
Users that are interested in stopwords-json are comparing it to the libraries listed below
- All languages stopwords collection☆427Updated last year
- List of common stop words in various languages.☆332Updated 2 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 4 years ago
- Quickly extract multi-word phrases from a corpus☆190Updated 4 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆773Updated 2 years ago
- Semantic Textual Similarity (STS) measures the degree of equivalence in the underlying semantics of paired snippets of text.☆92Updated 3 years ago
- Multilingual word vectors in 78 languages☆1,195Updated last year
- Python interface to the Stanford Named Entity Recognizer☆291Updated 3 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- Awesome-Text-Classification Projects,Papers,Tutorial .☆170Updated 7 years ago
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆264Updated last year
- Web Content Extraction Through Machine Learning☆185Updated 10 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆236Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- ☆215Updated 6 years ago
- Data for Automatic Keyphrase Extraction Task☆337Updated 6 years ago
- SemCor and Masc documents annotated with NOAD word senses.☆182Updated 4 years ago
- Extension of the original word2vec using different architectures☆210Updated 7 years ago
- English stopwords collection☆155Updated 8 years ago
- Quality information extraction at web scale. Edit☆327Updated 7 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆629Updated 3 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆335Updated 3 years ago
- spaCy REST API, wrapped in a Docker container.☆266Updated 2 years ago
- Yet another Python binding for fastText☆226Updated 6 years ago
- Python wrapper for Stanford CoreNLP tools v3.4.1☆611Updated 6 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- ADAM - A Question Answering System. Inspired from IBM Watson☆354Updated 4 years ago
- a Deep Learning based Speller☆225Updated 6 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 6 years ago