piskvorky / gensim-dataLinks
Data repository for pretrained NLP models and NLP corpora.
☆1,020Updated 7 years ago
Alternatives and similar repositories for gensim-data
Users that are interested in gensim-data are comparing it to the libraries listed below
Sorting:
- A python tool for evaluating the quality of sentence embeddings.☆2,106Updated last year
- ☆1,309Updated 2 years ago
- InferSent sentence embeddings☆2,284Updated 3 years ago
- General purpose unsupervised sentence representations☆1,204Updated 2 years ago
- Pre-trained ELMo Representations for Many Languages☆1,460Updated 4 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,389Updated last month
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,088Updated 5 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,879Updated 2 years ago
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,541Updated 2 weeks ago
- PyTorch deep learning models for document classification☆595Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,212Updated 8 months ago
- Python scripts for training/testing paragraph vectors☆650Updated 3 months ago
- Python wrapper for Stanford CoreNLP.☆920Updated 3 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,141Updated 10 months ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,614Updated 2 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,912Updated 2 years ago
- 🦆 Contextually-keyed word vectors☆1,654Updated 2 months ago
- Package for evaluating word embeddings☆436Updated 4 years ago
- Compute Sentence Embeddings Fast!☆623Updated 2 years ago
- This repository recorded my NLP journey.☆1,078Updated 4 years ago
- Python Keyphrase Extraction module☆1,580Updated last year
- Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization☆649Updated 3 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆734Updated 10 months ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,256Updated 3 years ago
- Text Classification Algorithms: A Survey☆1,813Updated 2 months ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,219Updated last year
- Overview of Modern Deep Learning Techniques Applied to Natural Language Processing☆1,329Updated 5 years ago
- TextRank implementation for Python 3.☆1,259Updated 2 years ago
- A curated list of resources dedicated to text summarization☆1,543Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆670Updated 3 weeks ago