piskvorky / gensim-data
Data repository for pretrained NLP models and NLP corpora.
β1,012Updated 7 years ago
Alternatives and similar repositories for gensim-data:
Users that are interested in gensim-data are comparing it to the libraries listed below
- General purpose unsupervised sentence representationsβ1,201Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,101Updated last year
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,376Updated last month
- β¨Fast Coreference Resolution in spaCy with Neural Networksβ2,870Updated last year
- Super easy library for BERT based NLP modelsβ1,890Updated 7 months ago
- Python Keyphrase Extraction moduleβ1,581Updated last year
- β1,297Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizatiβ¦β666Updated last year
- A curated list of pretrained sentence and word embedding modelsβ2,255Updated 3 years ago
- π₯ Use the latest Stanza (StanfordNLP) research models directly in spaCyβ733Updated 7 months ago
- PyTorch deep learning models for document classificationβ595Updated last year
- sentence embedding by Smooth Inverse Frequency weighting schemeβ1,086Updated 5 years ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/β1,253Updated 3 years ago
- Single-document unsupervised keyword extractionβ1,699Updated 3 weeks ago
- Python scripts for training/testing paragraph vectorsβ650Updated 3 weeks ago
- jiant is an nlp toolkitβ1,664Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)β1,123Updated 7 months ago
- InferSent sentence embeddingsβ2,283Updated 3 years ago
- Pre-trained ELMo Representations for Many Languagesβ1,461Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)β1,207Updated 6 months ago
- Overview of Modern Deep Learning Techniques Applied to Natural Language Processingβ1,334Updated 5 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language modelsβ1,619Updated 2 years ago
- Python library for interactive topic model visualization. Port of the R LDAvis package.β1,823Updated 8 months ago
- Compute Sentence Embeddings Fast!β622Updated 2 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.β3,021Updated 4 months ago
- A collection of notebooks for Natural Language Processing from NLP Townβ988Updated 8 months ago
- π¦ Contextually-keyed word vectorsβ1,645Updated last year
- Github repo with tutorials to fine tune transformers for diff NLP tasksβ844Updated last year
- PyTorch original implementation of Cross-lingual Language Model Pretraining.β2,905Updated 2 years ago
- Simple web service providing a word embedding modelβ1,439Updated last year