Improbable-AI / pqlLinks

Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation

☆72

Alternatives and similar repositories for pql

Users that are interested in pql are comparing it to the libraries listed below

Sorting:

wertyuilife2 / bmpc
[ICLR 2025] Bootstrapped Model Predictive Control
☆20Updated 2 weeks ago
Asap7772 / PTR
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆29Updated 2 years ago
kevinzakka / ibc
A PyTorch implementation of Implicit Behavioral Cloning
☆108Updated 3 years ago
Lei-Kun / Uni-O4
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆51Updated 6 months ago
ShaneFlandermeyer / tdmpc2-jax
Jax/Flax Implementation of TD-MPC2
☆65Updated last month
StoneT2000 / rfcl
(ICLR 2024) Reverse Forward Curriculum Learning
☆48Updated 8 months ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
☆70Updated 2 weeks ago
ToruOwO / minimal-stable-PPO
A minimal and stable PPO.
☆140Updated last year
penn-pal-lab / scaffolder
Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…
☆29Updated last year
Lifelong-ML / CompoSuite
Official release of CompoSuite, a compositional RL benchmark
☆49Updated last year
tdmpc2 / tdmpc2-eval
Evaluation of TD-MPC2.
☆22Updated last year
mihdalal / raps
[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives
☆78Updated 3 years ago
notmahi / bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
☆128Updated 2 years ago
seohongpark / METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆71Updated last year
google-research / relay-policy-learning
☆122Updated 5 years ago
yunhaif / fowm
Finetuning Offline World Models in the Real World
☆59Updated last year
aalmuzairee / dmcgb2
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆21Updated 2 weeks ago
Viraj-Joshi / MTBench
☆15Updated last week
youngwoon / robot-learning
☆53Updated 2 years ago
penn-pal-lab / peg
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆78Updated last year
realquantumcookie / APRL
Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization
☆76Updated last year
seohongpark / HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆86Updated 8 months ago
jianlanluo / SAQ
☆33Updated last month
iamlab-cmu / isaacgym-utils
Wrappers and utilities for Nvidia IsaacGym
☆100Updated 3 years ago
dibyaghosh / jaxrl_m
Skeleton for scalable and flexible Jax RL implementations
☆84Updated 2 years ago
google-deepmind / rgb_stacking
☆125Updated last year
chauncygu / Safe-Multi-Agent-Isaac-Gym
Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.
☆61Updated 6 months ago
siddhanthaldar / ROT
Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport
☆80Updated 2 years ago
clvrai / skimo
Skill-based Model-based Reinforcement Learning (CoRL 2022)
☆60Updated 2 years ago
facebookresearch / bipedal-skills
Bipedal Skills Benchmark for Reinforcement Learning
☆25Updated 2 years ago