Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆42Mar 26, 2024Updated 2 years ago
Alternatives and similar repositories for Clean-Offline-RLHF
Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Nov 20, 2024Updated last year
- ☆33Mar 10, 2024Updated 2 years ago
- ☆18Apr 17, 2026Updated last month
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆34Dec 7, 2024Updated last year
- ☆10Jun 27, 2024Updated last year
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- ☆18Jul 10, 2022Updated 3 years ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆20Oct 19, 2020Updated 5 years ago
- ☆16Apr 14, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆54Nov 10, 2022Updated 3 years ago
- ☆14Apr 25, 2026Updated last month
- Template Catkin package for ROS-1 Noetic; Contains basic structure for creating rospy nodes☆17Oct 21, 2022Updated 3 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Code for Contrastive Preference Learning (CPL)☆182Nov 22, 2024Updated last year
- ☆17Mar 10, 2026Updated 3 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆43Jul 20, 2024Updated last year
- ☆14May 31, 2022Updated 4 years ago
- Transformer-based World Models☆90Apr 4, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆168Oct 15, 2023Updated 2 years ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆22Jun 24, 2023Updated 2 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- ☆73Feb 4, 2024Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆145May 10, 2023Updated 3 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆41Feb 27, 2024Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…☆27Jul 20, 2022Updated 3 years ago
- Masked World Models for Visual Control☆137Jun 11, 2023Updated 3 years ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆16Sep 26, 2025Updated 8 months ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆12Feb 19, 2024Updated 2 years ago
- ☆27Apr 22, 2024Updated 2 years ago
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆28Aug 19, 2023Updated 2 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year