Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆42Nov 20, 2024Updated last year
Alternatives and similar repositories for Uni-RLHF-Platform
Users that are interested in Uni-RLHF-Platform are comparing it to the libraries listed below
Sorting:
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Mar 26, 2024Updated last year
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- Variational Autoencoder with non-euclidean (hyperbolic) latent space☆12Nov 25, 2022Updated 3 years ago
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- ☆16Oct 7, 2025Updated 4 months ago
- Corax: Core RL in JAX☆40Feb 22, 2024Updated 2 years ago
- Template Catkin package for ROS-1 Noetic; Contains basic structure for creating rospy nodes☆17Oct 21, 2022Updated 3 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 4 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- ☆74Feb 4, 2024Updated 2 years ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25May 29, 2025Updated 9 months ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆19Jan 11, 2023Updated 3 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆20Oct 19, 2020Updated 5 years ago
- ☆47Dec 11, 2023Updated 2 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Dec 8, 2022Updated 3 years ago
- Turn from Google research,A simple code to realize HDR plus☆16Jul 15, 2019Updated 6 years ago
- ☆23Aug 9, 2022Updated 3 years ago
- This is an easy to understand, simplified, broken-down implementation of Diffusion Models written in PyTorch. The architecture is borrowe…☆27Aug 18, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- A set of environments utilizing pybullet for simulation of robotic manipulation tasks.☆29Mar 8, 2021Updated 4 years ago
- Process Simulations Meet AI. Supercharge Your Process Engineering. Generate Infinite Data, Train Advanced Models, and Revolutionise Indus…☆11Oct 8, 2024Updated last year
- ☆26Feb 6, 2022Updated 4 years ago
- Masked World Models for Visual Control☆135Jun 11, 2023Updated 2 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆31Jun 15, 2020Updated 5 years ago
- ☆34May 27, 2023Updated 2 years ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆35Oct 15, 2024Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated last month
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆80Nov 19, 2022Updated 3 years ago
- 🕹 Pikachu-volleyball game-based multi-agent RL environment using PettingZoo☆11Sep 29, 2024Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year