haron1100 / Upside-Down-Reinforcement-Learning
Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch
☆27Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Upside-Down-Reinforcement-Learning
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆76Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆25Updated 3 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆23Updated 5 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆16Updated 4 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆42Updated last year
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- ☆21Updated 2 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆99Updated 4 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- General purpose environment wrappers for openai gym☆24Updated 5 years ago
- Hierarchical Self-Play☆21Updated 5 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆44Updated 6 years ago
- Episodic Control☆19Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆29Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago