piskvorky / gensim-dataLinks
Data repository for pretrained NLP models and NLP corpora.
☆1,025Updated 7 years ago
Alternatives and similar repositories for gensim-data
Users that are interested in gensim-data are comparing it to the libraries listed below
Sorting:
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆670Updated last month
- A python tool for evaluating the quality of sentence embeddings.☆2,107Updated last year
- General purpose unsupervised sentence representations☆1,204Updated 2 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,390Updated last month
- Super easy library for BERT based NLP models☆1,897Updated 11 months ago
- semi supervised guided topic model with custom guidedLDA☆510Updated 3 months ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆634Updated 4 years ago
- Python Keyphrase Extraction module☆1,580Updated 2 years ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,087Updated 5 years ago
- InferSent sentence embeddings☆2,282Updated 3 years ago
- ☆1,311Updated 3 years ago
- A curated list of resources dedicated to text summarization☆1,542Updated 2 years ago
- word2vec Google News model☆524Updated 5 years ago
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,543Updated last month
- Pre-trained ELMo Representations for Many Languages☆1,462Updated 4 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,144Updated 10 months ago
- Python scripts for training/testing paragraph vectors☆650Updated 4 months ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,215Updated 9 months ago
- Overview of Modern Deep Learning Techniques Applied to Natural Language Processing☆1,329Updated 5 years ago
- PyTorch deep learning models for document classification☆593Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,233Updated 5 months ago
- Repository with all what is necessary for sentiment analysis and related areas☆540Updated last year
- Semantic Text Similarity Dataset Hub☆718Updated 7 years ago
- TextRank implementation for Python 3.☆1,259Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆594Updated 11 months ago
- Topic Modeling in Embedding Spaces☆557Updated last year
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,882Updated 2 years ago
- This repository recorded my NLP journey.☆1,078Updated 4 years ago
- 🔡 Token level embeddings from BERT model on mxnet and gluonnlp☆451Updated 5 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆736Updated 11 months ago