princeton-nlp/SimCSE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/princeton-nlp/SimCSE)

princeton-nlp / SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

☆3,655

Alternatives and similar repositories for SimCSE

Users that are interested in SimCSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bojone / SimCSE
View on GitHub
SimCSE在中文任务上的简单实验
☆605Aug 7, 2023Updated 2 years ago
yym6472 / ConSERT
View on GitHub
Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
☆542Dec 10, 2021Updated 4 years ago
huggingface / sentence-transformers
View on GitHub
State-of-the-Art Embeddings, Retrieval, and Reranking
☆18,936Updated this week
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,224Apr 19, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dropreg / R-Drop
View on GitHub
☆880May 24, 2024Updated 2 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
princeton-nlp / LM-BFF
View on GitHub
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
☆727Aug 29, 2022Updated 3 years ago
vdogmcgee / SimCSE-Chinese-Pytorch
View on GitHub
SimCSE在中文上的复现，有监督+无监督
☆281Feb 21, 2025Updated last year
voidism / DiffCSE
View on GitHub
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
☆297Jul 12, 2026Updated last week
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,273Feb 6, 2026Updated 5 months ago
thunlp / PromptPapers
View on GitHub
Must-read papers on prompt-based tuning for pre-trained language models.
☆4,321Jul 17, 2023Updated 3 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,705May 8, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bohanli / BERT-flow
View on GitHub
TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)
☆535May 19, 2021Updated 5 years ago
thunlp / OpenPrompt
View on GitHub
An Open-Source Framework for Prompt-Learning.
☆4,886Jul 16, 2024Updated 2 years ago
facebookresearch / DPR
View on GitHub
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
☆1,869Apr 6, 2023Updated 3 years ago
kongds / Prompt-BERT
View on GitHub
PromptBERT: Improving BERT Sentence Embeddings with Prompts
☆341Nov 22, 2023Updated 2 years ago
bojone / BERT-whitening
View on GitHub
简单的向量白化改善句向量质量
☆486Jun 17, 2021Updated 5 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,793Jul 22, 2024Updated 2 years ago
bojone / bert4keras
View on GitHub
keras implement of transformers for humans
☆5,418Nov 11, 2024Updated last year
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,251Sep 30, 2025Updated 9 months ago
makcedward / nlpaug
View on GitHub
Data augmentation for NLP
☆4,663Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
namisan / mt-dnn
View on GitHub
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,259Mar 7, 2024Updated 2 years ago
autoliuweijie / BERT-whitening-pytorch
View on GitHub
Pytorch version of BERT-whitening
☆308Oct 9, 2021Updated 4 years ago
zhengyanzhao1997 / NLP-model
View on GitHub
☆278Apr 14, 2026Updated 3 months ago
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,168Jan 23, 2026Updated 6 months ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago
princeton-nlp / DensePhrases
View on GitHub
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆607Jun 15, 2022Updated 4 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,980Nov 21, 2022Updated 3 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
xinyi-code / SimCSE-Pytorch
View on GitHub
中文数据集下SimCSE+ESimCSE的实现
☆190May 21, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,536Jul 8, 2026Updated 2 weeks ago
ShannonAI / mrc-for-flat-nested-ner
View on GitHub
Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`
☆678Jun 12, 2023Updated 3 years ago
loujie0822 / DeepIE
View on GitHub
DeepIE: Deep Learning for Information Extraction
☆1,937Dec 9, 2022Updated 3 years ago
facebookresearch / SentEval
View on GitHub
A python tool for evaluating the quality of sentence embeddings.
☆2,110Mar 19, 2024Updated 2 years ago
THUDM / P-tuning
View on GitHub
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
☆938Oct 6, 2022Updated 3 years ago
google-research / electra
View on GitHub
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,367Mar 23, 2024Updated 2 years ago
alibaba / AliceMind
View on GitHub
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
☆2,042Mar 19, 2024Updated 2 years ago