Improbable-AI / pql
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
☆63Updated last year
Alternatives and similar repositories for pql:
Users that are interested in pql are comparing it to the libraries listed below
- (ICLR 2024) Reverse Forward Curriculum Learning☆40Updated last month
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆51Updated last year
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆45Updated 4 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆44Updated this week
- Finetuning Offline World Models in the Real World☆51Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆47Updated 11 months ago
- ☆45Updated last year
- ☆33Updated 2 weeks ago
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 6 months ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated 2 years ago
- OGBench: Benchmarking Offline Goal-Conditioned RL☆99Updated 2 months ago
- ☆111Updated 4 years ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆114Updated last year
- A PyTorch implementation of Implicit Behavioral Cloning☆97Updated 2 years ago
- [NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives☆73Updated 2 years ago
- Collection of MuJoCo robotics environments equipped with both vision and tactile sensing☆43Updated 6 months ago
- ☆46Updated 3 months ago
- PWM: Policy Learning with Large World Models☆39Updated 4 months ago
- Skeleton for scalable and flexible Jax RL implementations☆67Updated last year
- Code for Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations☆65Updated last year
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆22Updated 2 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Jax/Flax Implementation of TD-MPC2☆51Updated this week
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆45Updated 3 years ago
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆56Updated last week
- ☆58Updated last year
- Wrappers and utilities for Nvidia IsaacGym☆95Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆34Updated last year
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆28Updated last year
- Clean implementation of conditional and unconditional behavior transformer.☆27Updated last year