aalmuzairee / dmcgb2Links
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆22Updated 6 months ago
Alternatives and similar repositories for dmcgb2
Users that are interested in dmcgb2 are comparing it to the libraries listed below
Sorting:
- Evaluation of TD-MPC2.☆21Updated 2 years ago
- ☆39Updated 3 weeks ago
- The official implementation of Value Flows☆39Updated 3 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆76Updated 2 years ago
- ☆81Updated 3 weeks ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆52Updated last year
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆30Updated 3 years ago
- [ICLR 2025] Bootstrapped Model Predictive Control☆31Updated 6 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Updated 4 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82Updated last year
- PWM: Policy Learning with Large World Models☆65Updated 6 months ago
- ☆55Updated 2 years ago
- Jax/Flax Implementation of TD-MPC2☆70Updated 3 weeks ago
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆56Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆71Updated 6 months ago
- ☆31Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆92Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆50Updated 2 years ago
- Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL☆30Updated 3 months ago
- Q-learning with Adjoint Matching☆26Updated last week
- ☆23Updated 3 years ago
- Code for the Behavior Retrieval Paper☆36Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆84Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Updated 2 years ago
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆19Updated 3 years ago
- ☆60Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆113Updated last year
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆136Updated 2 years ago
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆57Updated 4 years ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆59Updated last year