hkust-nlp/SynCSE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hkust-nlp/SynCSE)

hkust-nlp / SynCSE

This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"

☆40

Alternatives and similar repositories for SynCSE

Users that are interested in SynCSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
BinWang28 / RSE
View on GitHub
Paper: Relational Sentence Embedding for Flexible Semantic Matching
☆12May 22, 2024Updated 2 years ago
perceptiveshawty / RankCSE
View on GitHub
Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)
☆49Mar 12, 2024Updated 2 years ago
eth-lre / book2dial
View on GitHub
Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots, ACL 2024 Findings
☆14Mar 27, 2025Updated last year
kanekomasahiro / eb-gec
View on GitHub
☆15Mar 15, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
djz233 / ClusterNS
View on GitHub
Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation
☆13Oct 16, 2023Updated 2 years ago
ekinakyurek / compgen
View on GitHub
Paper: Learning to Recombine and Resample Data for Compositional Generalization
☆11Oct 9, 2020Updated 5 years ago
BDBC-KG-NLP / MixCSE_AAAI2022
View on GitHub
Code for AAAI 2022 paper Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives
☆23Jun 14, 2022Updated 4 years ago
LeeSureman / E5-Retrieval-Reproduction
View on GitHub
Use contrastive learning to train a large language model (LLM) as a retriever
☆12Jul 19, 2024Updated 2 years ago
dengyang17 / LLM-Proactive
View on GitHub
☆15Nov 23, 2023Updated 2 years ago
ma787639046 / bowdpr
View on GitHub
[SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval
☆18Feb 29, 2024Updated 2 years ago
tqfang / comet-deepspeed
View on GitHub
Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Jan 23, 2022Updated 4 years ago
chakki-works / entitypedia
View on GitHub
Entitypedia is an Extended Named Entity Dictionary from Wikipedia.
☆13Dec 7, 2022Updated 3 years ago
BKHMSI / cultural-trends
View on GitHub
Investigating Cultural Alignment of Large Language Models
☆13Aug 14, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
NorskRegnesentral / NeuralTextSanitizer
View on GitHub
Neural models for detecting and masking personal information from texts
☆16Nov 25, 2022Updated 3 years ago
lemon0830 / promptCSE
View on GitHub
code for promptCSE, emnlp 2022
☆11Apr 10, 2023Updated 3 years ago
PlusLabNLP / Com2Sense
View on GitHub
Dataset & Code for Com2Sense Benchmark
☆13Sep 8, 2021Updated 4 years ago
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
turboLJY / Transfer-Prompts-for-Text-Generation
View on GitHub
☆16Aug 14, 2022Updated 3 years ago
LouChao98 / nner_as_parsing
View on GitHub
☆16Mar 22, 2023Updated 3 years ago
mudabek / encoding-cxr-report-gen
View on GitHub
On the Importance of Image Encoding in Automated Chest X-Ray Report Generation, BMVC 2022
☆16Dec 22, 2022Updated 3 years ago
worldbank / GISTEmbed
View on GitHub
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
☆45Mar 6, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HKUST-KnowComp / SubeventWriter
View on GitHub
Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…
☆11Oct 16, 2022Updated 3 years ago
andrejmiscic / simcls-pytorch
View on GitHub
PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"
☆16Oct 17, 2021Updated 4 years ago
ashleve / autoroot
View on GitHub
An experimental package for python project root setup with just one import
☆15Jun 22, 2026Updated last month
shehel / BERT_propaganda_detection
View on GitHub
Propaganda detection using fine-tuned BERT
☆20Jul 21, 2022Updated 4 years ago
masora1030 / eigoyurusan
View on GitHub
To be readable without enhancing english power.
☆10Jul 22, 2020Updated 6 years ago
EhimeNLP / AcademicRoBERTa
View on GitHub
☆10Sep 3, 2024Updated last year
KUIS-AI / Arabic-ALBERT
View on GitHub
Arabic edition of ALBERT pretrained language models
☆15Apr 25, 2021Updated 5 years ago
lipiji / uChecker
View on GitHub
Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"
☆19Aug 17, 2022Updated 3 years ago
hkust-nlp / deepsearch-tts
View on GitHub
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
☆21Oct 8, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yuzhaouoe / pretraining-data-packing
View on GitHub
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆24Aug 18, 2024Updated last year
microsoft / EfficientLongSequenceModeling
View on GitHub
☆54Jan 19, 2023Updated 3 years ago
yeonsw / RankEncoder
View on GitHub
☆35May 18, 2023Updated 3 years ago
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
Shentao-YANG / Preference_Grounded_Guidance
View on GitHub
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
shihono / evaluate_japanese_w2v
View on GitHub
script to evaluate pre-trained Japanese word2vec model on Japanese similarity dataset
☆12Nov 4, 2024Updated last year
masdevid / sentistrength_id
View on GitHub
Sentiment Strength Detection in Bahasa Indonesia
☆41Mar 23, 2017Updated 9 years ago