piskvorky / gensim-data
Data repository for pretrained NLP models and NLP corpora.
☆1,003Updated 6 years ago
Alternatives and similar repositories for gensim-data:
Users that are interested in gensim-data are comparing it to the libraries listed below
- General purpose unsupervised sentence representations☆1,200Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,094Updated 11 months ago
- Overview of Modern Deep Learning Techniques Applied to Natural Language Processing☆1,333Updated 4 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,365Updated 3 weeks ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,085Updated 5 years ago
- Simple web service providing a word embedding model☆1,437Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,200Updated 5 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,113Updated 6 months ago
- InferSent sentence embeddings☆2,286Updated 3 years ago
- A fast, efficient universal vector embedding utility package.☆1,642Updated last year
- A curated list of resources dedicated to text summarization☆1,541Updated 2 years ago
- Text Similarity☆403Updated 4 years ago
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,526Updated 3 months ago
- Super easy library for BERT based NLP models☆1,886Updated 6 months ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,253Updated 3 years ago
- Python Keyphrase Extraction module☆1,576Updated last year
- A curated list of pretrained sentence and word embedding models☆2,249Updated 3 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,865Updated last year
- Python scripts for training/testing paragraph vectors☆648Updated last year
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,619Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fast☆460Updated last year
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆666Updated last year
- Python wrapper for Stanford CoreNLP.☆921Updated 3 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 6 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,219Updated last month
- 🦆 Contextually-keyed word vectors☆1,640Updated 11 months ago
- semi supervised guided topic model with custom guidedLDA☆503Updated 4 years ago
- GSDMM: Short text clustering☆355Updated 2 years ago
- TextRank implementation for Python 3.☆1,253Updated last year
- This repository recorded my NLP journey.☆1,077Updated 4 years ago