friolero/self_aligned_reward_learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/friolero/self_aligned_reward_learning)

friolero / self_aligned_reward_learning

[ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

☆19

Alternatives and similar repositories for self_aligned_reward_learning

Users that are interested in self_aligned_reward_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

labicon / CurricuLLM
View on GitHub
Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
☆28Sep 26, 2025Updated 9 months ago
Suhan-Ling / Coarse-to-fine_Affordance
View on GitHub
☆16Oct 10, 2024Updated last year
elena-ecn / quadrotor_mpc
View on GitHub
Model Predictive Control of a quadrotor for trajectory tracking.
☆13May 8, 2023Updated 3 years ago
Tom0Brien / tinympc
View on GitHub
A lightweight implementation of MPC and NMPC in C++ using Eigen3
☆11Oct 27, 2023Updated 2 years ago
hegdepashupati / gaussian-process-odes
View on GitHub
Implementation of the work Variational multiple shooting for Bayesian ODEs with Gaussian processes
☆13Aug 5, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hridaybavle / altitude_filtering
View on GitHub
An implementation of an EKF based multi-sensor fusion algorithm, used for accurate flight altitude estimation of UAVs. Fusing the IMU, la…
☆12Sep 10, 2017Updated 8 years ago
xlang-ai / text2reward
View on GitHub
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
☆210Dec 17, 2024Updated last year
catezi / MAPT
View on GitHub
This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…
☆12Apr 9, 2026Updated 3 months ago
Guozheng-Ma / Adaptive-Replay-Ratio
View on GitHub
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
chwoong / LiRE
View on GitHub
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆18Jun 18, 2024Updated 2 years ago
SgtVincent / EMOS
View on GitHub
The project repository for paper EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents: https://arxiv.org/abs…
☆76Jan 6, 2025Updated last year
xycheng / DCFNet
View on GitHub
☆12Oct 27, 2018Updated 7 years ago
yijiangh / coop_assembly
View on GitHub
Geometry generation/planning for robotically assembled spatial structures
☆14Mar 23, 2023Updated 3 years ago
davidireland-iso / LeNSE
View on GitHub
☆14Nov 26, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HauffQian / DGAP
View on GitHub
☆14May 13, 2025Updated last year
brandleyzhou / SUB-Depth
View on GitHub
[BMVC 2022] "SUB-Depth: Self-distillation and Uncertainty Boosting Self-supervised Monocular Depth Estimation"
☆16Apr 19, 2022Updated 4 years ago
spatialdatasciencegroup / HST
View on GitHub
[NeurIPS '23] Official code of "A Hierarchical Spatial Transformer for Massive Point Samples in Continuous Space"
☆14Jul 13, 2025Updated last year
Sentient-Beings / Behavior-Trees
View on GitHub
Learn to use Behavior Trees with a simple example
☆16Jun 29, 2025Updated last year
twankim / avod_ssn
View on GitHub
Deep Sensor Fusion for Single Source Robustness
☆12Feb 5, 2026Updated 5 months ago
BarSGuy / Subgraphormer
View on GitHub
Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph Products (ICML 2024)
☆11Jul 13, 2024Updated 2 years ago
GeWu-Lab / Action-Preference-Optimization
View on GitHub
☆16Oct 26, 2025Updated 8 months ago
Li-ChangHao / CoNav
View on GitHub
☆12Jul 16, 2024Updated 2 years ago
GrigoryBartosh / sde_matching
View on GitHub
☆20Apr 19, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
PPjmchen / vlmpc
View on GitHub
☆80Apr 8, 2025Updated last year
MinhZou / UniG-Encoder
View on GitHub
UniG-Encoder: A Universal Feature Encoder for Graph and Hypergraph Node Classification.
☆14Jul 18, 2025Updated last year
mangdangroboticsclub / mini_pupper_2_bsp
View on GitHub
BSP(Board Support Package) for Mini Pupper 2
☆12Jun 3, 2026Updated last month
dsbrown1331 / bayesianrex
View on GitHub
☆21Dec 17, 2020Updated 5 years ago
BeingBeyond / Being-H0
View on GitHub
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos (ICML 2026)
☆51May 4, 2026Updated 2 months ago
ai-ar-research / Lemur-program-verification
View on GitHub
A verifier that integrates LLMs into automated C program verification
☆15Apr 4, 2026Updated 3 months ago
yufeiwang63 / RL-VLM-F
View on GitHub
Code for Reinforcement Learning from Vision Language Foundation Model Feedback
☆140May 22, 2024Updated 2 years ago
Yiminghh / VertexEntanglement
View on GitHub
☆17Apr 14, 2024Updated 2 years ago
fuyw / FuRL
View on GitHub
☆25Aug 19, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PinkWink / ros2_basic
View on GitHub
☆19Apr 19, 2024Updated 2 years ago
epfl-lasa / cpr_load_support
View on GitHub
This package is going to provide a controller for the ClearPath mobile robot to approach a load (carrying by the human) and support it an…
☆10Nov 8, 2017Updated 8 years ago
sharinka0715 / FlowDreamer
View on GitHub
[RA-L 2026] Official implemetation of the paper "FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipu…
☆19Jan 19, 2026Updated 6 months ago
ademiadeniji / lamp
View on GitHub
☆47Jan 29, 2024Updated 2 years ago
zowiezhang / STAS
View on GitHub
The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'
☆17Oct 6, 2024Updated last year
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
dannysdeng / dqn-pytorch
View on GitHub
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Jul 26, 2019Updated 6 years ago