wertyuilife2 / bmpcLinks
[ICLR 2025] Bootstrapped Model Predictive Control
☆26Updated 5 months ago
Alternatives and similar repositories for bmpc
Users that are interested in bmpc are comparing it to the libraries listed below
Sorting:
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆76Updated 2 years ago
- ☆36Updated 3 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆51Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆32Updated 3 months ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆58Updated last year
- Jax/Flax Implementation of TD-MPC2☆69Updated last week
- Official repo for paper "TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint"☆74Updated 10 months ago
- PWM: Policy Learning with Large World Models☆64Updated 4 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆73Updated 11 months ago
- ☆67Updated 2 weeks ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆180Updated 4 months ago
- ☆28Updated last year
- [ICML2025] Official implementation of Efficient Online Reinforcement Learning for Diffusion Policies appearing in ICML 2025.☆40Updated this week
- JAX implementation of WSRL and RL baselines | ICLR 2025☆124Updated 5 months ago
- [ICLR 2024] Official implementation for "Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations"☆88Updated 10 months ago
- ☆68Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆81Updated last year
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆77Updated 4 months ago
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆64Updated 11 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆81Updated 2 years ago
- Learning Optimal Policies Through Contact in Differentiable Simulation☆107Updated last year
- ☆47Updated 3 months ago
- Evaluation of TD-MPC2.☆21Updated last year
- The official implementation of Value Flows☆35Updated 2 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆82Updated last month
- Official repository for "STAP: Sequencing Task-Agnostic Policies," presented at ICRA 2023.☆51Updated 10 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Updated 2 years ago
- Finetuning Offline World Models in the Real World☆63Updated 2 years ago
- Official implementation of DEMO3☆66Updated 5 months ago
- Skeleton for scalable and flexible Jax RL implementations☆92Updated 2 years ago