piskvorky / gensim-dataLinks
Data repository for pretrained NLP models and NLP corpora.
β1,041Updated 7 years ago
Alternatives and similar repositories for gensim-data
Users that are interested in gensim-data are comparing it to the libraries listed below
Sorting:
- General purpose unsupervised sentence representationsβ1,208Updated 3 years ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,403Updated 2 months ago
- Python Keyphrase Extraction moduleβ1,587Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizatiβ¦β673Updated 7 months ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorExβ640Updated 4 years ago
- Python scripts for training/testing paragraph vectorsβ651Updated 4 months ago
- sentence embedding by Smooth Inverse Frequency weighting schemeβ1,087Updated 6 years ago
- β1,315Updated 3 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,108Updated last year
- TextRank implementation for Python 3.β1,268Updated 2 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherβ¦β1,256Updated 5 months ago
- Super easy library for BERT based NLP modelsβ1,915Updated last year
- Easy to use extractive text summarization with BERTβ1,454Updated 2 years ago
- InferSent sentence embeddingsβ2,279Updated 4 years ago
- Simple web service providing a word embedding modelβ1,445Updated 2 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)β1,172Updated last year
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of langβ¦β1,557Updated 7 months ago
- Semantic Text Similarity Dataset Hubβ725Updated 7 years ago
- A collection of notebooks for Natural Language Processing from NLP Townβ1,013Updated last year
- word2vec Google News modelβ529Updated 6 years ago
- semi supervised guided topic model with custom guidedLDAβ512Updated 8 months ago
- Single-document unsupervised keyword extractionβ1,811Updated last month
- A curated list of resources dedicated to text summarizationβ1,540Updated 3 years ago
- π‘ Token level embeddings from BERT model on mxnet and gluonnlpβ451Updated 6 years ago
- Code for paper Fine-tune BERT for Extractive Summarizationβ1,505Updated 4 years ago
- Text Similarityβ399Updated 5 years ago
- This repository recorded my NLP journey.β1,083Updated 5 years ago
- π₯ Use the latest Stanza (StanfordNLP) research models directly in spaCyβ744Updated last year
- Topic Modeling in Embedding Spacesβ560Updated 2 years ago
- Compute Sentence Embeddings Fast!β624Updated 2 years ago