ikostrikov / rlpd
☆231Updated last year
Related projects ⓘ
Alternatives and complementary repositories for rlpd
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- Author's PyTorch implementation of TD7 for online and offline RL☆115Updated last year
- Repo for Implicit Diffusion Q-Learning☆93Updated 11 months ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆109Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆209Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆394Updated last week
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆77Updated 3 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- ☆92Updated this week
- DMControl Generalization Benchmark☆168Updated 10 months ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆113Updated 2 years ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 7 months ago
- Code for "Temporal Difference Learning for Model Predictive Control"☆360Updated 11 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆98Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆113Updated last year
- ☆69Updated 2 years ago
- ☆201Updated 9 months ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- ☆235Updated 2 years ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆91Updated this week
- ☆332Updated 2 years ago
- Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020☆191Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆58Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Synthetic Experience Replay☆73Updated 5 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆155Updated 2 years ago
- Official implementation of Diffusion Policy Policy Optimization, arxiv 2024☆227Updated this week
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 5 months ago
- A minimal and stable PPO.☆122Updated 9 months ago