CJReinforce/RIME_ICML2024

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CJReinforce/RIME_ICML2024)

CJReinforce / RIME_ICML2024

Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)

☆35

Alternatives and similar repositories for RIME_ICML2024

Users that are interested in RIME_ICML2024 are comparing it to the libraries listed below

Sorting:

CJReinforce / JOWA
View on GitHub
Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"
☆28Dec 1, 2024Updated last year
chwoong / LiRE
View on GitHub
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆17Jun 18, 2024Updated last year
huxiao09 / QPA
View on GitHub
☆13Sep 24, 2024Updated last year
CASIA-IVA-Lab / SC-Tune
View on GitHub
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
☆16Apr 22, 2024Updated last year
catezi / MAPT
View on GitHub
This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…
☆11Feb 6, 2025Updated last year
Weixy21 / ABNet
View on GitHub
Adaptive explicit-Barrier Net for Safe and Scalable Robot Learning
☆15May 16, 2025Updated 9 months ago
USC-Lira / language-preference-learning
View on GitHub
☆13Feb 21, 2025Updated last year
rll-research / rune
View on GitHub
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
☆15May 26, 2022Updated 3 years ago
CJReinforce / PURE
View on GitHub
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
☆158Oct 23, 2025Updated 4 months ago
mahaozhe / ReLara
View on GitHub
[ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)
☆16Aug 2, 2024Updated last year
snu-mllab / DPPO
View on GitHub
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
☆42Jul 20, 2024Updated last year
LinFunster / PP-TIL
View on GitHub
[IROS 2024] PP-TIL: Personalized Planning for Autonomous Driving with Instance-based Transfer Imitation Learning
☆24Feb 28, 2025Updated last year
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated 11 months ago
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
mengyuest / pSTL-diffusion-policy
View on GitHub
[RA-L/ICRA2025] Official implementation for paper "Diverse Controllable Diffusion Policy with Signal Temporal Logic."
☆34Oct 17, 2024Updated last year
Shengqiang-Zhang / LoHo-Ravens
View on GitHub
Official code for the long-horizon language-conditioned robotic manipulation benchmark LoHoRavens.
☆22Oct 8, 2024Updated last year
MayankD409 / RL_MPC
View on GitHub
This project implements a reinforcement learning (RL) agent integrated with a Model Predictive Controller (MPC) for autonomous lane chang…
☆40Jan 14, 2025Updated last year
Improbable-AI / random-latent-exploration
View on GitHub
☆28Aug 19, 2024Updated last year
declare-lab / nora-1.5
View on GitHub
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
☆93Jan 11, 2026Updated last month
jacky121298 / WLST
View on GitHub
[ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection
☆12Feb 6, 2024Updated 2 years ago
jhejna / few-shot-preference-rl
View on GitHub
☆37Apr 27, 2023Updated 2 years ago
kedarrajpathak / bimanual_teleoperation
View on GitHub
ROS2 packages for dual arm setup of Kinova robot and control using MoveIt Servo and ArUco pose estimation
☆10Jul 27, 2025Updated 7 months ago
SkyLineHXY / Piper_Mujoco_Sim
View on GitHub
采用松灵机械臂在Mujoco环境下实现Yolo-world+Sam+Graspnet传统抓取方法
☆15Sep 15, 2025Updated 5 months ago
ImprintLab / SPA
View on GitHub
SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)
☆14Sep 26, 2025Updated 5 months ago
darshmenon / UR3_ROS2_PICK_AND_PLACE
View on GitHub
UR Robotic Arm with Robotiq 2-Finger Gripper for ROS2
☆22Feb 27, 2026Updated last week
hjy-u / ETOG
View on GitHub
[ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
☆11Feb 7, 2025Updated last year
LucasG2001 / cartesian_impedance_control
View on GitHub
ROS2 catestian_impedance_controller from PdZ
☆11Oct 22, 2025Updated 4 months ago
hk-zh / language-conditioned-robot-manipulation-models
View on GitHub
https://arxiv.org/abs/2312.10807
☆78Dec 29, 2025Updated 2 months ago
jhejna / inverse-preference-learning
View on GitHub
☆43May 25, 2023Updated 2 years ago
MarcDcls / mjlab_upkie
View on GitHub
This repository contains Reinforcement Learning (RL) environments for the Upkie robot.
☆23Feb 23, 2026Updated last week
FrankJIE09 / Peg_in_Hole_with_OnRobot_Force_Sensor-UR_Robot
View on GitHub
☆13Jun 8, 2024Updated last year
0nandon / SEAL
View on GitHub
[ICLR 2026] Official code of "Segment any Events with Language"
☆35Feb 7, 2026Updated last month
Chenan-W / Python-Trajectory-Tracking-Control-for-UAV
View on GitHub
单无人机对螺旋轨迹跟踪的实物实验
☆10May 22, 2023Updated 2 years ago
vint-1 / dreamsmooth
View on GitHub
DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)
☆12May 6, 2024Updated last year
UTNuclearRoboticsPublic / closed-chain-affordance
View on GitHub
Joint trajectory planning for constrained manipulation using the Closed-Chain Affordance framework by Janak Panthi
☆11Jan 19, 2026Updated last month
hzm8341 / vla_tutorial
View on GitHub
how to learn vla
☆13Jan 15, 2025Updated last year
Improbable-AI / orso
View on GitHub
☆16Feb 22, 2025Updated last year
holken / polite
View on GitHub
code for polite
☆11Feb 28, 2024Updated 2 years ago
DripNowhy / Sherlock
View on GitHub
[NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"
☆28Sep 18, 2025Updated 5 months ago