chanb / rl_sandbox_publicLinks

PyTorch implementation of (Deep) Reinforcement Learning (RL) algorithms

☆24

Alternatives and similar repositories for rl_sandbox_public

Users that are interested in rl_sandbox_public are comparing it to the libraries listed below

Sorting:

kngwyu / mujoco-maze
Simple maze environments using mujoco-py
☆57Updated last year
adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆78Updated last year
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
☆70Updated 3 weeks ago
hari-sikchi / AWAC
Advantage weighted Actor Critic for Offline RL
☆50Updated 2 years ago
liuzuxin / safe-mbrl
Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method
☆66Updated 2 years ago
Wenxuan-Zhou / PLAS
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
☆53Updated 3 years ago
jakegrigsby / super_sac
A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…
☆38Updated last year
ikostrikov / dmcgym
☆23Updated 2 years ago
dibyaghosh / jaxrl_m
Skeleton for scalable and flexible Jax RL implementations
☆84Updated 2 years ago
elliotchanesane31 / RIS
☆54Updated 4 years ago
cross32768 / PlaNet_PyTorch
Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch
☆47Updated 5 years ago
martius-lab / pink-noise-rl
☆42Updated 2 years ago
twni2016 / f-IRL
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
☆45Updated 2 years ago
ikostrikov / jaxrl2
☆47Updated 2 years ago
QData / dmc_remastered
A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.
☆18Updated 4 years ago
denisyarats / exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
☆115Updated 3 years ago
tedmoskovitz / TOP
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Updated 2 years ago
natolambert / continuousprediction
Formulating Model-based RL Dynamics as a continuous rather then one step prediction
☆35Updated 2 years ago
SvenGronauer / Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
☆70Updated 2 years ago
facebookresearch / hsd3
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆50Updated 3 years ago
hari-sikchi / LOOP
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆40Updated 2 years ago
OffDynamicsRL / off-dynamics-rl
☆49Updated 3 weeks ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆171Updated 8 months ago
evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆100Updated 3 years ago
nakamotoo / Cal-QL
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
☆104Updated last year
montrealrobotics / domain-randomizer
A standalone library to randomize various OpenAI Gym Environments
☆63Updated 5 years ago
clvrai / skimo
Skill-based Model-based Reinforcement Learning (CoRL 2022)
☆60Updated 2 years ago
snasiriany / leap
Official codebase for LEAP: Planning with Goal Conditioned Policies
☆49Updated 2 years ago
intelligent-control-lab / guard
☆53Updated 6 months ago
KarlXing / RL-Visual-Continuous-Control
RL Algorithms for Visual Continuous Control
☆32Updated 2 years ago