Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
☆79Jun 10, 2025Updated 8 months ago
Alternatives and similar repositories for RiC
Users that are interested in RiC are comparing it to the libraries listed below
Sorting:
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Oct 30, 2024Updated last year
- Rewarded soups official implementation☆62Sep 27, 2023Updated 2 years ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆96Aug 20, 2024Updated last year
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 2 months ago
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆16Mar 18, 2025Updated 11 months ago
- Wenzhou-Kean University AI-LAB☆10Jun 6, 2022Updated 3 years ago
- This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)☆49Jan 15, 2026Updated last month
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 2 years ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Oct 23, 2024Updated last year
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 5 years ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 11 months ago
- ☆28Mar 13, 2019Updated 6 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 9 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- ☆35Oct 14, 2023Updated 2 years ago
- 用PyTorch重构流传最广的Keras、TensorFlow做的TORCS实验。训练DDPG模型。☆12Dec 23, 2018Updated 7 years ago
- Directional Preference Alignment☆58Sep 23, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 6 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Nov 14, 2024Updated last year
- ☆19Jun 21, 2025Updated 8 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Sep 26, 2024Updated last year
- Source code for Stable Hadamard Memory☆24May 6, 2025Updated 10 months ago
- ☆18May 10, 2024Updated last year
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆42Oct 20, 2025Updated 4 months ago
- RewardBench: the first evaluation tool for reward models.☆697Feb 16, 2026Updated 2 weeks ago
- ☆20Jun 14, 2022Updated 3 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Jun 12, 2023Updated 2 years ago
- [WACV 2024] Domain Generalisation via Risk Distribution Matching☆22Sep 19, 2024Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Oct 10, 2024Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆50Aug 26, 2023Updated 2 years ago
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆59Jan 15, 2025Updated last year
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- Code and data for reproducing baselines for TopiOCQA, an open-domain conversational question-answering dataset☆56Nov 15, 2023Updated 2 years ago
- ☆25Aug 21, 2024Updated last year
- ☆31Jan 7, 2023Updated 3 years ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆39Sep 8, 2025Updated 5 months ago