Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆34Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for hidden-context
Users that are interested in hidden-context are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Dec 17, 2020Updated 5 years ago
- ☆16Apr 12, 2023Updated 3 years ago
- ☆48Mar 25, 2025Updated last year
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Jul 11, 2024Updated last year
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- ☆160Nov 23, 2024Updated last year
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆51Jul 29, 2024Updated last year
- PyTorch implementation of "The Option Keyboard: Combining Skills in Reinforcement Learning" (NeurIPS 2019)☆12Jul 2, 2020Updated 5 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- code to reproduce the empirical results in the research paper☆38Oct 12, 2021Updated 4 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆51Jun 26, 2024Updated last year
- ☆10Mar 13, 2023Updated 3 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆22Aug 1, 2021Updated 4 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆118Oct 23, 2023Updated 2 years ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆96Aug 20, 2024Updated last year
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- ☆13Jul 2, 2025Updated 9 months ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models☆26Sep 27, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆39Aug 2, 2024Updated last year
- ☆23Jun 13, 2024Updated last year
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Jun 20, 2019Updated 6 years ago
- K* search based implementation of top-k and top-quality planners☆19Apr 1, 2026Updated 2 weeks ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- unity课程大作业☆10May 21, 2023Updated 2 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆79Jun 10, 2025Updated 10 months ago
- ☆14May 1, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Dateset Reset Policy Optimization☆31Apr 12, 2024Updated 2 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxi…☆67Oct 18, 2021Updated 4 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Sep 1, 2022Updated 3 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- ☆15Dec 31, 2020Updated 5 years ago