wkh923 / m3pcLinks
M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025
☆19Updated 10 months ago
Alternatives and similar repositories for m3pc
Users that are interested in m3pc are comparing it to the libraries listed below
Sorting:
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆24Updated 4 months ago
- ☆58Updated last year
- ☆48Updated last year
- ☆50Updated last year
- official implementation of QVPO☆60Updated 2 weeks ago
- Official repo for Offline RL for Online RL☆19Updated 2 years ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Updated 4 months ago
- [ICML 2025] The Official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"☆30Updated last month
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆26Updated last year
- ☆35Updated last year
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆53Updated last year
- Code for Compositional Diffusion-Based Continuous Constraint Solvers (CoRL 23)☆67Updated 2 years ago
- [RA-L/ICRA2025] Official implementation for paper "Diverse Controllable Diffusion Policy with Signal Temporal Logic."☆34Updated last year
- ☆13Updated last month
- ☆50Updated 4 months ago
- Q-learning with Adjoint Matching☆44Updated last week
- Interactive Fleet Learning Benchmark☆37Updated 2 years ago
- [ICML2025] Official implementation of Efficient Online Reinforcement Learning for Diffusion Policies appearing in ICML 2025.☆49Updated 2 weeks ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Updated 2 weeks ago
- Evaluation of TD-MPC2.☆21Updated 2 years ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Updated 2 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12Updated last year
- ☆19Updated 11 months ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆59Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆76Updated last year
- PWM: Policy Learning with Large World Models☆65Updated 6 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆28Updated last year
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆82Updated 2 years ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆73Updated last year