ShaneFlandermeyer / tdmpc2-jax
Jax/Flax Implementation of TD-MPC2
☆48Updated this week
Related projects ⓘ
Alternatives and complementary repositories for tdmpc2-jax
- OGBench: Benchmarking Offline Goal-Conditioned RL☆81Updated 3 weeks ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆38Updated 2 months ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆43Updated 2 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆62Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆73Updated 6 months ago
- ☆42Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- ☆40Updated 2 months ago
- [ICLR 2024] Official implementation for "Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations"☆43Updated 6 months ago
- PWM: Policy Learning with Large World Models☆37Updated 3 months ago
- Code of the paper "LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning" & LocoMuJoCo Baselines☆42Updated 9 months ago
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆52Updated 2 years ago
- Coarse-to-fine Q-Network☆33Updated 3 months ago
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆17Updated last month
- A minimal and stable PPO.☆124Updated 9 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆60Updated 8 months ago
- ☆27Updated 3 months ago
- [RSS 2023] Official code for "Goal Conditioned Imitation Learning using Score-based Diffusion Policies"☆62Updated 11 months ago
- Demo-Driven Mobile Bi-Manual Manipulation Benchmark.☆111Updated last month
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- A Minimal Example of Isaac Gym with DQN and PPO.☆94Updated last year
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆57Updated last year
- Wrappers and utilities for Nvidia IsaacGym☆93Updated 2 years ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆16Updated 5 months ago
- ☆22Updated 3 years ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆109Updated last year
- From Imitation to Refinement -- Residual RL for Precise Visual Assembly☆55Updated last week
- ☆52Updated last year