ShaneFlandermeyer / tdmpc2-jax
Jax/Flax Implementation of TD-MPC2
☆59Updated last week
Alternatives and similar repositories for tdmpc2-jax:
Users that are interested in tdmpc2-jax are comparing it to the libraries listed below
- ☆75Updated 3 weeks ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆69Updated last year
- (ICLR 2024) Reverse Forward Curriculum Learning☆44Updated 4 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆143Updated this week
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆68Updated 9 months ago
- Official repo for paper "TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint"☆45Updated last month
- JAX implementation of WSRL and RL baselines | ICLR 2025☆34Updated last week
- ☆23Updated 7 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆77Updated 10 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆137Updated this week
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated 11 months ago
- Skeleton for scalable and flexible Jax RL implementations☆76Updated last year
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆47Updated 6 months ago
- ☆40Updated 3 months ago
- ☆47Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆68Updated last year
- 🔥 Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆35Updated this week
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆18Updated 9 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 9 months ago
- Finetuning Offline World Models in the Real World☆57Updated last year
- [ICLR 2025] Bootstrapped Model Predictive Control☆11Updated this week
- A minimal and stable PPO.☆135Updated last year
- ☆79Updated last month
- [RSS 2023] Official code for "Goal Conditioned Imitation Learning using Score-based Diffusion Policies"☆73Updated last year
- [ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to effi…☆40Updated 9 months ago
- A Minimal Example of Isaac Gym with DQN and PPO.☆104Updated last year
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆26Updated 5 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆58Updated 2 years ago
- ☆70Updated 2 years ago