piskvorky / gensim-data
Data repository for pretrained NLP models and NLP corpora.
☆989Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for gensim-data
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,083Updated 5 years ago
- General purpose unsupervised sentence representations☆1,193Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,087Updated 8 months ago
- Super easy library for BERT based NLP models☆1,866Updated 3 months ago
- InferSent sentence embeddings☆2,280Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,187Updated last month
- Pre-trained ELMo Representations for Many Languages☆1,463Updated 3 years ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,253Updated 2 years ago
- A curated list of resources dedicated to text summarization☆1,535Updated last year
- Simple web service providing a word embedding model☆1,433Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,097Updated 2 months ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆661Updated 8 months ago
- A curated list of pretrained sentence and word embedding models☆2,233Updated 3 years ago
- Python Keyphrase Extraction module☆1,565Updated last year
- word2vec Google News model☆516Updated 4 years ago
- jiant is an nlp toolkit☆1,647Updated last year
- Semantic Text Similarity Dataset Hub☆715Updated 6 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,620Updated last year
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,857Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,197Updated 3 months ago
- NLP, before and after spaCy☆2,217Updated last year
- A framework to learn cross-lingual word embedding mappings☆645Updated last year
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,352Updated 5 months ago
- Python scripts for training/testing paragraph vectors☆645Updated last year
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,190Updated 2 years ago
- ☆1,295Updated 2 years ago
- semi supervised guided topic model with custom guidedLDA☆499Updated 4 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆1,211Updated 3 years ago
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,806Updated 4 months ago
- Single-document unsupervised keyword extraction☆1,648Updated 10 months ago