THUDM/Self-Contrast

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THUDM/Self-Contrast)

THUDM / Self-Contrast

Extensive Self-Contrast Enables Feedback-Free Language Model Alignment

☆20

Alternatives and similar repositories for Self-Contrast

Users that are interested in Self-Contrast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
songxxzp / OpenReviewers
View on GitHub
Openreviewers: Multi Agent Academic Review Simulation System
☆24Mar 2, 2024Updated 2 years ago
thu-cs-lab / webhookd
View on GitHub
A simple gitlab/github web hooks daemon
☆16May 15, 2026Updated 2 months ago
facebookresearch / RLCD
View on GitHub
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
☆70Aug 18, 2023Updated 2 years ago
dzp2095 / grabClass
View on GitHub
武汉理工大学抢课/一键评教 pyqt
☆13Jul 5, 2018Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Cornell-RL / drpo
View on GitHub
Dateset Reset Policy Optimization
☆30Apr 12, 2024Updated 2 years ago
StanfordDataCompressionClass / notes
View on GitHub
☆13Apr 12, 2026Updated 3 months ago
XiaojuanTang / Mars
View on GitHub
a benchmark to evaluate the situated inductive reasoning
☆16Jan 7, 2025Updated last year
hyoseok1223 / Product-of-Experts-GAN
View on GitHub
PyTorch unoffical implementation of "PoE-GAN : Multimodal Conditional Image Synthesis with Product-of-Experts GANs"
☆15Mar 29, 2023Updated 3 years ago
LeeChanHyuk / Weighted-Boxes-Fusion-implementation
View on GitHub
Weighted-Boxes-Fusion method implementation with YOLOv4 and YOLOv5
☆11Jul 14, 2022Updated 4 years ago
THUDM / NaturalCodeBench
View on GitHub
NaturalCodeBench (Findings of ACL 2024)
☆70Oct 14, 2024Updated last year
tsinghua-fib-lab / SmartAgent
View on GitHub
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Aug 20, 2025Updated 11 months ago
yinyueqin / relative-preference-optimization
View on GitHub
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
☆26Feb 23, 2024Updated 2 years ago
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zx-pan / mdm
View on GitHub
Codebase for the paper Masked Diffusion as Self-Supervised Representation Learner
☆15Apr 12, 2024Updated 2 years ago
SeanLeng1 / Reward-Calibration
View on GitHub
☆21Dec 14, 2024Updated last year
Rainier-rq / verl-if
View on GitHub
Official implementation of the paper "Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following"
☆40Jan 11, 2026Updated 6 months ago
zerolllin / Delta-L-Normalization
View on GitHub
☆16Oct 11, 2025Updated 9 months ago
abmfy / wordle
View on GitHub
A Wordle game written in Rust, refined. Play in browser with the power of WebAssembly! Course project of Programming Training, Tsinghua U…
☆16Jul 10, 2024Updated 2 years ago
RikkiXu / NCD_PC
View on GitHub
Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation (ECCV2024)
☆14Nov 1, 2024Updated last year
VITA-Group / TAPE
View on GitHub
[ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…
☆15Jun 6, 2025Updated last year
jina-ai / mteb-de
View on GitHub
MTEB: Massive Text Embedding Benchmark
☆11Jan 29, 2024Updated 2 years ago
gauss5930 / AlpaGasus2-QLoRA
View on GitHub
This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!
☆15Nov 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yanivle / fast_minbpe
View on GitHub
☆18Feb 6, 2025Updated last year
JoHof / IntegratedGradientsTutorial
View on GitHub
Very concise example of integrated gradients (a method to reveal areas of attention in input images)
☆10Jun 17, 2019Updated 7 years ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
Azure-stars / arceos
View on GitHub
An experimental modular OS written in Rust.
☆17Feb 11, 2025Updated last year
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
gabrielcassimiro17 / async-langchain
View on GitHub
Demonstration of how to run multiple chains in Langchain Assyncronously
☆12Jul 6, 2023Updated 3 years ago
austrian-code-wizard / c3po
View on GitHub
☆30Apr 6, 2026Updated 3 months ago
THUDM / SciGLM
View on GitHub
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)
☆88Feb 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Nix07 / finetuning
View on GitHub
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…
☆32Oct 27, 2025Updated 8 months ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
cat-state / clip_benchmark
View on GitHub
clip retrieval benchmark
☆17May 4, 2022Updated 4 years ago
dinobby / MAgICoRE
View on GitHub
☆23Sep 19, 2024Updated last year
F2-Song / ICDPO
View on GitHub
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…
☆16Feb 15, 2024Updated 2 years ago
yang-zhang / labse-pytorch
View on GitHub
Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model
☆21Sep 2, 2020Updated 5 years ago
FanbinLu / STEVE-R1
View on GitHub
R1-like Computer-use Agent
☆91Mar 21, 2025Updated last year