piskvorky / gensim-dataLinks
Data repository for pretrained NLP models and NLP corpora.
☆1,046Updated 7 years ago
Alternatives and similar repositories for gensim-data
Users that are interested in gensim-data are comparing it to the libraries listed below
Sorting:
- General purpose unsupervised sentence representations☆1,208Updated 3 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆673Updated 8 months ago
- Super easy library for BERT based NLP models☆1,916Updated last year
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,402Updated 2 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,107Updated last year
- Python Keyphrase Extraction module☆1,586Updated 2 years ago
- TextRank implementation for Python 3.☆1,269Updated 2 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,891Updated 2 years ago
- PyTorch deep learning models for document classification☆596Updated 2 years ago
- semi supervised guided topic model with custom guidedLDA☆513Updated 9 months ago
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,559Updated 7 months ago
- Python scripts for training/testing paragraph vectors☆652Updated 4 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,174Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,259Updated 6 months ago
- InferSent sentence embeddings☆2,278Updated 4 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆744Updated last year
- ☆1,317Updated 3 years ago
- word2vec Google News model☆530Updated 6 years ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,086Updated 6 years ago
- jiant is an nlp toolkit☆1,675Updated 2 years ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆640Updated 4 years ago
- Simple web service providing a word embedding model☆1,445Updated 2 years ago
- The guide to tackle with the Text Summarization☆1,313Updated 3 years ago
- NLP, before and after spaCy☆2,232Updated 2 years ago
- 🦆 Contextually-keyed word vectors☆1,670Updated 9 months ago
- Compute Sentence Embeddings Fast!☆624Updated 2 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,221Updated last year
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,846Updated last month
- Text Similarity☆399Updated 5 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆747Updated 3 years ago