piskvorky / gensim-data
Data repository for pretrained NLP models and NLP corpora.
☆997Updated 6 years ago
Alternatives and similar repositories for gensim-data:
Users that are interested in gensim-data are comparing it to the libraries listed below
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆664Updated 11 months ago
- General purpose unsupervised sentence representations☆1,199Updated 2 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,619Updated last year
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,254Updated 2 years ago
- InferSent sentence embeddings☆2,285Updated 3 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,864Updated last year
- Python scripts for training/testing paragraph vectors☆647Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,091Updated 10 months ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,085Updated 5 years ago
- Simple web service providing a word embedding model☆1,437Updated last year
- Automatically exported from code.google.com/p/word2vec☆1,529Updated last year
- Overview of Modern Deep Learning Techniques Applied to Natural Language Processing☆1,331Updated 4 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,359Updated 2 weeks ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,193Updated 3 months ago
- Python wrapper for Stanford CoreNLP.☆923Updated 3 years ago
- Super easy library for BERT based NLP models☆1,878Updated 5 months ago
- semi supervised guided topic model with custom guidedLDA☆502Updated 4 years ago
- Python Keyphrase Extraction module☆1,572Updated last year
- A curated list of pretrained sentence and word embedding models☆2,241Updated 3 years ago
- PyTorch deep learning models for document classification☆593Updated last year
- Code for paper Fine-tune BERT for Extractive Summarization☆1,473Updated 3 years ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆628Updated 3 years ago
- Text Classification Algorithms: A Survey☆1,811Updated 3 months ago
- A framework to learn cross-lingual word embedding mappings☆647Updated last year
- Pre-trained ELMo Representations for Many Languages☆1,462Updated 3 years ago
- Python interface to CoreNLP using a bidirectional server-client interface.☆518Updated 3 years ago
- ☆1,296Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,345Updated 10 months ago
- Package for evaluating word embeddings☆436Updated 4 years ago
- Stanford Open Information Extraction made simple!☆647Updated last year