piskvorky / gensim-dataLinks
Data repository for pretrained NLP models and NLP corpora.
☆1,037Updated 7 years ago
Alternatives and similar repositories for gensim-data
Users that are interested in gensim-data are comparing it to the libraries listed below
Sorting:
- General purpose unsupervised sentence representations☆1,204Updated 3 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆671Updated 4 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,396Updated 4 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,108Updated last year
- Super easy library for BERT based NLP models☆1,909Updated last year
- Overview of Modern Deep Learning Techniques Applied to Natural Language Processing☆1,327Updated 5 years ago
- word2vec Google News model☆528Updated 5 years ago
- ☆1,308Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,216Updated last year
- Text Similarity☆398Updated 5 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆739Updated last year
- InferSent sentence embeddings☆2,279Updated 4 years ago
- semi supervised guided topic model with custom guidedLDA☆511Updated 6 months ago
- Python Keyphrase Extraction module☆1,584Updated 2 years ago
- Python scripts for training/testing paragraph vectors☆649Updated last month
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆640Updated 4 years ago
- Compute Sentence Embeddings Fast!☆623Updated 2 years ago
- jiant is an nlp toolkit☆1,668Updated 2 years ago
- Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regr…☆1,180Updated 4 years ago
- A curated list of resources dedicated to text summarization☆1,541Updated 2 years ago
- A curated list of pretrained sentence and word embedding models☆2,278Updated 4 years ago
- NLP, before and after spaCy☆2,230Updated 2 years ago
- TextRank implementation for Python 3.☆1,265Updated 2 years ago
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,841Updated last year
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,087Updated 6 years ago
- Simple web service providing a word embedding model☆1,443Updated 2 years ago
- Semantic Text Similarity Dataset Hub☆721Updated 7 years ago
- 🦆 Contextually-keyed word vectors☆1,661Updated 5 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,151Updated last year
- GSDMM: Short text clustering☆357Updated 2 years ago