Improbable-AI / pqlLinks
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
☆71Updated last year
Alternatives and similar repositories for pql
Users that are interested in pql are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Bootstrapped Model Predictive Control☆20Updated 2 months ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated 2 years ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆47Updated 7 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆51Updated 5 months ago
- A PyTorch implementation of Implicit Behavioral Cloning☆106Updated 3 years ago
- [NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives☆78Updated 3 years ago
- Official release of CompoSuite, a compositional RL benchmark☆49Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated last year
- ☆52Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆37Updated 2 years ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆29Updated last year
- ☆120Updated 5 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆78Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆70Updated last year
- ☆122Updated 11 months ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆19Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆83Updated 2 years ago
- A minimal and stable PPO.☆138Updated last year
- ☆33Updated last month
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆86Updated 7 months ago
- Jax/Flax Implementation of TD-MPC2☆65Updated 2 weeks ago
- Bipedal Skills Benchmark for Reinforcement Learning☆25Updated 2 years ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆52Updated 9 months ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆126Updated 2 years ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆74Updated last year
- ☆23Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆76Updated last year
- ☆70Updated 3 years ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆116Updated last month
- Official repository for "STAP: Sequencing Task-Agnostic Policies," presented at ICRA 2023.☆45Updated 5 months ago