pickxiguapi/Uni-RLHF-Platform

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pickxiguapi/Uni-RLHF-Platform)

pickxiguapi / Uni-RLHF-Platform

Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

☆42

Alternatives and similar repositories for Uni-RLHF-Platform

Users that are interested in Uni-RLHF-Platform are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pickxiguapi / Clean-Offline-RLHF
View on GitHub
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆42Mar 26, 2024Updated 2 years ago
ZibinDong / AlignDiff-ICLR2024
View on GitHub
☆33Mar 10, 2024Updated 2 years ago
jhejna / few-shot-preference-rl
View on GitHub
☆38Apr 27, 2023Updated 3 years ago
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
YuejiangLIU / prioritized_option_critic
View on GitHub
Implementation of the Prioritized Option-Critic on the Four-Rooms Environment
☆17Dec 24, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
daniel-robotics / ros_python_pkg
View on GitHub
Template Catkin package for ROS-1 Noetic; Contains basic structure for creating rospy nodes
☆17Oct 21, 2022Updated 3 years ago
ethanluoyc / corax
View on GitHub
Corax: Core RL in JAX
☆41Feb 22, 2024Updated 2 years ago
QData / dmc_remastered
View on GitHub
A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.
☆20Oct 19, 2020Updated 5 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
ByteDance-Seed / TaskMem
View on GitHub
☆26Jun 2, 2026Updated last month
IanYangChina / SI4RP-data
View on GitHub
☆17Updated this week
FuxiRL / DunkCityDynasty
View on GitHub
☆74Feb 4, 2024Updated 2 years ago
ReinholdM / Papers-of-Offline-RL
View on GitHub
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 4 years ago
Bluedotdot2021 / PRML-book_review
View on GitHub
PRML Page-by-page配套资料，对PRML全书及各章节的review
☆17Apr 16, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ZibinDong / cocos
View on GitHub
Official implementation of the paper "Conditioning Matters: Training Diffusion Policies is Faster Than You Think".
☆18May 19, 2025Updated last year
holken / polite
View on GitHub
code for polite
☆12Feb 28, 2024Updated 2 years ago
JayYang168 / FJSP
View on GitHub
遗传算法求解柔性车间调度问题
☆13Jun 3, 2023Updated 3 years ago
CJReinforce / RIME_ICML2024
View on GitHub
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
☆36Oct 15, 2024Updated last year
jimimvp / torch_rl
View on GitHub
Reinforcement learning library for PyTorch.
☆11Jun 15, 2018Updated 8 years ago
apple / ml-reed
View on GitHub
☆13Feb 5, 2024Updated 2 years ago
julian-8897 / hyperbolic-latent-vae
View on GitHub
Variational Autoencoder with non-euclidean (hyperbolic) latent space
☆14Nov 25, 2022Updated 3 years ago
csmile-1006 / ARP
View on GitHub
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
☆33Sep 25, 2023Updated 2 years ago
YivanZhang / lio
View on GitHub
Learning from Indirect Observations
☆11Jul 16, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rll-research / BPref
View on GitHub
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆136Nov 3, 2021Updated 4 years ago
younggyoseo / MWM
View on GitHub
Masked World Models for Visual Control
☆138Jun 11, 2023Updated 3 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 3 weeks ago
SDS-Lab / QW_Loss
View on GitHub
A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)
☆10May 20, 2024Updated 2 years ago
dsbrown1331 / CoRL2019-DREX
View on GitHub
Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…
☆51Dec 8, 2022Updated 3 years ago
Jasonxu1225 / Improved-Lightweight-YOLOv5-for-Face-Mask-Detection
View on GitHub
[ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code
☆10Feb 27, 2024Updated 2 years ago
TrentBrick / RewardConditionedUDRL
View on GitHub
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆19Mar 10, 2021Updated 5 years ago
ymetz / rlhfblender
View on GitHub
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
☆14May 19, 2026Updated 2 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
apayani / ILP
View on GitHub
☆10Nov 27, 2019Updated 6 years ago
facebookresearch / hsd3
View on GitHub
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆52Jun 3, 2022Updated 4 years ago
ShengxiLi / rcf_gan
View on GitHub
☆13Jul 6, 2023Updated 3 years ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
Jasonxu1225 / Awesome-Constraint-Inference-in-RL
View on GitHub
[TMLR 2025] A collection of research papers on constraint inference within the field of RL
☆11May 9, 2025Updated last year
yufeiwang63 / ROLL
View on GitHub
Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020
☆16Jun 22, 2022Updated 4 years ago
automl / CARL
View on GitHub
Benchmarking RL generalization in an interpretable way.
☆183Nov 20, 2025Updated 8 months ago