qgallouedec / deep_rlLinks

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

☆21

Alternatives and similar repositories for deep_rl

Users that are interested in deep_rl are comparing it to the libraries listed below

Sorting:

martius-lab / pink-noise-rl
☆42Updated 2 years ago
tseyde / decqn
☆36Updated 2 years ago
adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆75Updated last year
architsharma97 / earl_benchmark
EARL: Environment for Autonomous Reinforcement Learning
☆37Updated 2 years ago
ikostrikov / dmcgym
☆23Updated 2 years ago
RyanNavillus / reward-surfaces
☆17Updated last year
Lifelong-ML / offline-compositional-rl-datasets
☆17Updated last year
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
☆70Updated 11 months ago
pairlab / vagram
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆24Updated 2 years ago
Egiob / DiversityIsAllYouNeed-SB3
Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.
☆12Updated 2 years ago
sdpkjc / abcdrl
Modular Single-file Reinfocement Learning Algorithms Library
☆37Updated 2 years ago
max7born / decision-lstm
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆27Updated 2 years ago
Improbable-AI / dw-offline-rl
Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
☆26Updated last year
ikostrikov / jaxrl2
☆47Updated 2 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
MahanFathi / Model-Based-RL
Model-based Policy Gradients
☆31Updated 5 years ago
steventango / jumpstart-rl
Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3
☆31Updated last year
RedTachyon / coltra-rl
A modular implementation of PPO, and soon hopefully other algorithms.
☆26Updated last year
sahandrez / homomorphic_policy_gradient
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆23Updated last year
frt03 / generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆67Updated 2 years ago
omron-sinicx / action-constrained-RL-benchmark
☆24Updated last year
facebookresearch / entity-factored-rl
Source code for the paper "Policy Architectures for Compositional Generalization in Control"
☆30Updated 3 years ago
frankroeder / lanro-gym
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Updated 3 months ago
rraileanu / idaac
☆53Updated last year
hari-sikchi / LOOP
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆39Updated 2 years ago
seohongpark / HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆85Updated 6 months ago
facebookresearch / hsd3
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆50Updated 3 years ago
Lifelong-ML / CompoSuite
Official release of CompoSuite, a compositional RL benchmark
☆49Updated last year
penn-pal-lab / peg
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆78Updated last year
sven1977 / dreamer_v3
Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023
☆3Updated last year