Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆41Mar 26, 2024Updated last year
Alternatives and similar repositories for Clean-Offline-RLHF
Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below
Sorting:
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Nov 20, 2024Updated last year
- ☆32Mar 10, 2024Updated 2 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆18Mar 10, 2026Updated last week
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Corax: Core RL in JAX☆41Feb 22, 2024Updated 2 years ago
- Academic Personal Homepage of Jian Tang☆15Mar 9, 2026Updated 2 weeks ago
- ☆10Jun 27, 2024Updated last year
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- ☆18Jul 10, 2022Updated 3 years ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆20Oct 19, 2020Updated 5 years ago
- ☆53Nov 10, 2022Updated 3 years ago
- ☆15Jan 18, 2026Updated 2 months ago
- ☆13Jun 29, 2023Updated 2 years ago
- ☆13Oct 16, 2025Updated 5 months ago
- Template Catkin package for ROS-1 Noetic; Contains basic structure for creating rospy nodes☆17Oct 21, 2022Updated 3 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Jul 20, 2024Updated last year
- ☆16Mar 10, 2026Updated last week
- ☆14May 31, 2022Updated 3 years ago
- Transformer-based World Models☆89Apr 4, 2023Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆22Jun 24, 2023Updated 2 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- ☆74Feb 4, 2024Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆143May 10, 2023Updated 2 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆41Feb 27, 2024Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…☆27Jul 20, 2022Updated 3 years ago
- Masked World Models for Visual Control☆135Jun 11, 2023Updated 2 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- ☆27Apr 22, 2024Updated last year
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆29Aug 19, 2023Updated 2 years ago