YangRui2015/RiC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YangRui2015/RiC)

YangRui2015 / RiC

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

☆78

Alternatives and similar repositories for RiC

Users that are interested in RiC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

srzer / MOD
View on GitHub
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆30Oct 30, 2024Updated last year
alexrame / rewardedsoups
View on GitHub
Rewarded soups official implementation
☆64Sep 27, 2023Updated 2 years ago
princeton-nlp / unintentional-unalignment
View on GitHub
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆32Jan 7, 2026Updated 6 months ago
SteveKGYang / MetaAligner
View on GitHub
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆24Sep 26, 2024Updated last year
junkangwu / Dr_DPO
View on GitHub
[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"
☆19Jun 1, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenBMB / CPO
View on GitHub
☆29Jul 16, 2024Updated 2 years ago
RLHFlow / Directional-Preference-Alignment
View on GitHub
Directional Preference Alignment
☆62Sep 23, 2024Updated last year
WKUAILAB / AI_Tutorial
View on GitHub
Wenzhou-Kean University AI-LAB
☆10Jun 6, 2022Updated 4 years ago
git-disl / Vaccine
View on GitHub
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆51Jan 15, 2026Updated 6 months ago
MaHuanAAA / g_fair_prompting
View on GitHub
☆37Oct 14, 2023Updated 2 years ago
cassidylaidlaw / hidden-context
View on GitHub
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆35Dec 14, 2023Updated 2 years ago
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 11 months ago
zankner / CLoud
View on GitHub
Critique-out-Loud Reward Models
☆76Oct 18, 2024Updated last year
ByteDance-Seed / DATAMASK
View on GitHub
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
☆21Jan 4, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yunke-wang / gail_atari
View on GitHub
PyTorch Implementation of Visual GAIL in Atari Games
☆14Dec 7, 2022Updated 3 years ago
zowiezhang / Amulet
View on GitHub
Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"
☆20Mar 18, 2025Updated last year
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
cxcscmu / MATES
View on GitHub
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆80Nov 14, 2024Updated last year
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
NLie2 / what_features_jailbreak_LLMs
View on GitHub
☆18Mar 30, 2025Updated last year
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
wizardlancet / diagnosis_zero
View on GitHub
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆32Jul 24, 2025Updated last year
shenao-zhang / SELM
View on GitHub
The official implementation of Self-Exploring Language Models (SELM)
☆63Jun 4, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChenDRAG / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction （ICML 2023）
☆55Aug 26, 2023Updated 2 years ago
convei-lab / BotsTalk
View on GitHub
🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…
☆16Oct 7, 2024Updated last year
maxidl / MMD-critic
View on GitHub
A PyTorch based implementation of MMD-critic
☆16Nov 12, 2020Updated 5 years ago
HumanCompatibleAI / atari-irl
View on GitHub
☆28Mar 13, 2019Updated 7 years ago
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆49Oct 10, 2024Updated last year
umd-huang-lab / VLM-Poisoning
View on GitHub
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆61Jan 15, 2025Updated last year
liziniu / GEM
View on GitHub
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆58May 12, 2025Updated last year
allenai / FineGrainedRLHF
View on GitHub
☆283Jan 6, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
allenai / natural-instructions-v1
View on GitHub
Benchmarking Generalization to New Tasks from Natural Language Instructions
☆28Jul 2, 2021Updated 5 years ago
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
LiuAmber / RAHF
View on GitHub
[ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…
☆28Sep 25, 2024Updated last year
YangRui2015 / AWGCSL
View on GitHub
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
☆27Feb 21, 2022Updated 4 years ago
cmavro / PackLLM
View on GitHub
Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization
☆15Apr 25, 2024Updated 2 years ago
namezhenzhang / CM2-RLCR-Tool-Agent
View on GitHub
Official Implementation of Papar CM2
☆25Apr 21, 2026Updated 3 months ago
ewsheng / controllable-nlg-biases
View on GitHub
Framework for controlling demographic biases in NLG (using adversarial prompts)
☆21Jun 12, 2023Updated 3 years ago