mavischer / DRRL
A2C training of Relational Deep Reinforcement Learning Architecture
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for DRRL
- ☆53Updated 8 months ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆17Updated 10 months ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆15Updated 4 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- ☆51Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- ☆28Updated 2 years ago
- ☆18Updated last year
- ☆24Updated last year
- ☆40Updated 3 years ago
- ☆18Updated last year
- Mirror Descent Policy Optimization☆37Updated 4 years ago
- ☆13Updated 3 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆17Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆28Updated last year
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆53Updated 5 years ago
- ☆47Updated last year
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- ☆30Updated 3 months ago