UKPLab / sentence-transformersLinks

State-of-the-Art Text Embeddings

☆17,710

Alternatives and similar repositories for sentence-transformers

Users that are interested in sentence-transformers are comparing it to the libraries listed below

Sorting:

jessevig / bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
☆7,710Updated 4 months ago
google / sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,371Updated 3 weeks ago
jina-ai / clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
☆12,759Updated last year
MaartenGr / KeyBERT
Minimal keyword extraction with BERT
☆4,023Updated 3 months ago
MaartenGr / BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆7,129Updated this week
google-research / text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,442Updated 5 months ago
ThilinaRajapakse / simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…
☆4,217Updated 2 months ago
doccano / doccano
Open source annotation tool for machine learning practitioners.
☆10,342Updated 4 months ago
huggingface / tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
☆10,179Updated last week
facebookresearch / fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆31,875Updated 3 weeks ago
flairNLP / flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
☆14,306Updated 2 months ago
huggingface / transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆151,291Updated last week
google-research / bert
TensorFlow code and pre-trained models for BERT
☆39,596Updated last year
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,900Updated this week
princeton-nlp / SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,606Updated last year
makcedward / nlpaug
Data augmentation for NLP
☆4,621Updated last year
facebookresearch / faiss
A library for efficient similarity search and clustering of dense vectors.
☆37,626Updated this week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆20,151Updated this week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,211Updated last week
microsoft / unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆21,787Updated 3 months ago
stanfordnlp / stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
☆7,634Updated this week
huggingface / trl
Train transformer language models with reinforcement learning.
☆15,934Updated last week
allenai / allennlp
An open-source NLP research library, built on PyTorch.
☆11,881Updated 2 years ago
harvardnlp / annotated-transformer
An annotated implementation of the Transformer paper.
☆6,629Updated last year
Lightning-AI / pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
☆30,299Updated this week
ddangelov / Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
☆3,090Updated 11 months ago
codertimo / BERT-pytorch
Google AI 2018 BERT pytorch implementation
☆6,485Updated 2 years ago
piskvorky / gensim
Topic Modelling for Humans
☆16,233Updated last week
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆13,906Updated this week
zihangdai / xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,181Updated 2 years ago