Improbable-AI / pqlLinks
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
☆76Updated 2 years ago
Alternatives and similar repositories for pql
Users that are interested in pql are comparing it to the libraries listed below
Sorting:
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆30Updated 3 years ago
- [ICLR 2025] Bootstrapped Model Predictive Control☆30Updated 5 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆81Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆74Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆50Updated last year
- ☆131Updated 5 years ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆51Updated last year
- ☆55Updated 2 years ago
- ☆38Updated this week
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Updated 3 months ago
- [NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives☆79Updated 3 years ago
- A PyTorch implementation of Implicit Behavioral Cloning☆110Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 6 months ago
- Jax/Flax Implementation of TD-MPC2☆70Updated last week
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆81Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆94Updated 2 years ago
- A minimal and stable PPO.☆146Updated last year
- ☆75Updated last week
- The official implementation of "Horizon Reduction Makes RL Scalable"☆180Updated 5 months ago
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆64Updated 3 years ago
- ☆35Updated 7 months ago
- ☆129Updated last year
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆56Updated 2 years ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆136Updated 2 years ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Updated 5 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆81Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 3 years ago
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆83Updated 2 years ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆59Updated last year
- ☆72Updated 3 years ago