ShaneFlandermeyer / tdmpc2-jax
Jax/Flax Implementation of TD-MPC2
☆52Updated this week
Alternatives and similar repositories for tdmpc2-jax:
Users that are interested in tdmpc2-jax are comparing it to the libraries listed below
- OGBench: Benchmarking Offline Goal-Conditioned RL☆103Updated 3 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆39Updated 2 months ago
- ☆48Updated this week
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆65Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆78Updated 8 months ago
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆23Updated 3 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆61Updated 10 months ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆45Updated 4 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆64Updated 7 months ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆18Updated 7 months ago
- Skeleton for scalable and flexible Jax RL implementations☆69Updated last year
- A minimal and stable PPO.☆129Updated 11 months ago
- ☆30Updated this week
- JAX implementation of WSRL and RL baselines | ICLR 2025☆19Updated 2 weeks ago
- PWM: Policy Learning with Large World Models☆39Updated 5 months ago
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆57Updated 3 weeks ago
- ☆34Updated last month
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆44Updated 2 weeks ago
- ☆46Updated 2 years ago
- ☆49Updated 4 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆116Updated last week
- ☆42Updated 6 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 6 months ago
- Collection of MuJoCo robotics environments equipped with both vision and tactile sensing☆44Updated 6 months ago
- ☆68Updated 3 months ago
- Learning Optimal Policies Through Contact in Differentiable Simulation☆92Updated 8 months ago
- ☆35Updated 3 weeks ago
- Evaluation of TD-MPC2.☆22Updated last year
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆103Updated last year
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆116Updated last year