Improbable-AI / pqlLinks
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
☆74Updated 2 years ago
Alternatives and similar repositories for pql
Users that are interested in pql are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Bootstrapped Model Predictive Control☆21Updated last month
- Jax/Flax Implementation of TD-MPC2☆64Updated 3 weeks ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆28Updated 2 years ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆48Updated 9 months ago
- ☆29Updated last week
- ☆53Updated 2 years ago
- ☆126Updated 5 years ago
- [NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives☆78Updated 3 years ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆53Updated 7 months ago
- A PyTorch implementation of Implicit Behavioral Cloning☆109Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated last month
- A minimal and stable PPO.☆142Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆50Updated last year
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆59Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆73Updated last year
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆131Updated 2 years ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆29Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆77Updated last year
- ☆127Updated last year
- ☆33Updated 3 months ago
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆61Updated 8 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆88Updated 9 months ago
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆82Updated 2 years ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆78Updated last year
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆21Updated last month
- Official repository for "STAP: Sequencing Task-Agnostic Policies," presented at ICRA 2023.☆48Updated 7 months ago
- ☆85Updated 3 years ago
- Wrappers and utilities for Nvidia IsaacGym☆99Updated 3 years ago
- [ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to effi…☆41Updated last year
- ☆70Updated 3 years ago