wkh923 / m3pcLinks
M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025
☆17Updated 6 months ago
Alternatives and similar repositories for m3pc
Users that are interested in m3pc are comparing it to the libraries listed below
Sorting:
- [ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆64Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆19Updated last week
- Official repo for Offline RL for Online RL☆18Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆53Updated 8 months ago
- ☆54Updated 8 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆24Updated 11 months ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆20Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆30Updated 4 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆68Updated last year
- ☆30Updated last year
- ☆45Updated last week
- ☆44Updated last year
- ☆47Updated last year
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆21Updated 11 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆29Updated last year
- official implementation of QVPO☆48Updated 11 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- Framework to transform natural language into formal language (Temporal Logics).☆32Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Updated 2 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆11Updated last year
- ☆12Updated 7 months ago
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆21Updated last year
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆46Updated 9 months ago
- ☆24Updated last year
- ☆25Updated last year
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆44Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆34Updated 11 months ago
- Meta-RL Model-Based Algorithm☆40Updated 4 months ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆34Updated last year
- PWM: Policy Learning with Large World Models☆57Updated last month