Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆41Mar 26, 2024Updated last year
Alternatives and similar repositories for Clean-Offline-RLHF
Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below
Sorting:
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Nov 20, 2024Updated last year
- ☆32Mar 10, 2024Updated last year
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Dec 7, 2024Updated last year
- ☆10Jun 27, 2024Updated last year
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆12Mar 15, 2022Updated 3 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…☆27Jul 20, 2022Updated 3 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Transformer-based World Models☆89Apr 4, 2023Updated 2 years ago
- [ACM MM 2022] Target-Driven Structured Transformer Planner for Vision-Language Navigation☆17Nov 1, 2022Updated 3 years ago
- ☆16Oct 7, 2025Updated 4 months ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Dec 15, 2022Updated 3 years ago
- Corax: Core RL in JAX☆39Feb 22, 2024Updated 2 years ago
- A ray-based library of Distributed POPulation-based OPtimization for Large-Scale Black-Box Optimization.☆18Feb 23, 2024Updated 2 years ago
- ☆15Jan 18, 2026Updated last month
- Template Catkin package for ROS-1 Noetic; Contains basic structure for creating rospy nodes☆17Oct 21, 2022Updated 3 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- ☆14May 31, 2022Updated 3 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆41Feb 27, 2024Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆17Feb 14, 2024Updated 2 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20May 25, 2021Updated 4 years ago
- ☆74Feb 4, 2024Updated 2 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- Federated learning is a distributed learning method that trains a deep network on user devices without collecting data from central serve…☆14Jul 7, 2020Updated 5 years ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25May 29, 2025Updated 9 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- Code for Contrastive Preference Learning (CPL)☆179Nov 22, 2024Updated last year
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆23Jun 24, 2023Updated 2 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆22Apr 26, 2023Updated 2 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year