piskvorky / gensim-dataLinks
Data repository for pretrained NLP models and NLP corpora.
☆1,021Updated 7 years ago
Alternatives and similar repositories for gensim-data
Users that are interested in gensim-data are comparing it to the libraries listed below
Sorting:
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,088Updated 5 years ago
- Pre-trained ELMo Representations for Many Languages☆1,460Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,108Updated last year
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,615Updated 2 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,387Updated last week
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,139Updated 9 months ago
- 🔡 Token level embeddings from BERT model on mxnet and gluonnlp☆452Updated 5 years ago
- ☆1,308Updated 2 years ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,254Updated 3 years ago
- General purpose unsupervised sentence representations☆1,204Updated 2 years ago
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆2,189Updated 2 years ago
- This repository recorded my NLP journey.☆1,078Updated 4 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,879Updated 2 years ago
- Super easy library for BERT based NLP models☆1,898Updated 9 months ago
- Python scripts for training/testing paragraph vectors☆650Updated 3 months ago
- Python interface to CoreNLP using a bidirectional server-client interface.☆521Updated 3 years ago
- A curated list of resources dedicated to text summarization☆1,543Updated 2 years ago
- Text Classification Algorithms: A Survey☆1,811Updated 2 months ago
- Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.☆1,710Updated 2 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,896Updated 2 years ago
- A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)☆694Updated 3 years ago
- Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization☆648Updated 2 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,638Updated 2 years ago
- GSDMM: Short text clustering☆355Updated 2 years ago
- Text Similarity☆400Updated 5 years ago
- Topic modeling with latent Dirichlet allocation using Gibbs sampling☆1,278Updated 10 months ago
- Package for evaluating word embeddings☆436Updated 4 years ago
- A framework to learn cross-lingual word embedding mappings☆649Updated 2 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆1,240Updated 4 years ago
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,538Updated last week