ml-jku / baselines-rudderView external linksLinks
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
☆268Oct 24, 2019Updated 6 years ago
Alternatives and similar repositories for baselines-rudder
Users that are interested in baselines-rudder are comparing it to the libraries listed below
Sorting:
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Code for the paper "Large-Scale Study of Curiosity-Driven Learning"☆830Aug 12, 2021Updated 4 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆435Nov 28, 2023Updated 2 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,471Dec 7, 2022Updated 3 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Mar 18, 2019Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Code for the paper "Exploration by Random Network Distillation"☆931Oct 1, 2020Updated 5 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆618Jul 6, 2023Updated 2 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆206Nov 22, 2018Updated 7 years ago
- Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.☆562Nov 1, 2020Updated 5 years ago
- A reinforcement learning framework☆157Dec 26, 2018Updated 7 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,019Mar 13, 2019Updated 6 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 7 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆182Sep 21, 2017Updated 8 years ago
- ☆91Nov 15, 2019Updated 6 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,040Jun 10, 2023Updated 2 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203May 30, 2018Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- Gated Path Planning Networks (ICML 2018)☆180Jan 23, 2019Updated 7 years ago
- An implementation of the Augmented Random Search algorithm☆427Sep 29, 2021Updated 4 years ago
- Code for the paper "Evolved Policy Gradients"☆253Nov 22, 2018Updated 7 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,123Oct 13, 2017Updated 8 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,876May 29, 2022Updated 3 years ago
- Tensorflow Implementation of Programmable Agents☆35Sep 25, 2017Updated 8 years ago
- Unsupervised instance segmentation via active robot interaction☆76Jul 1, 2022Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Guided Policy Search☆603Feb 9, 2021Updated 5 years ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,660Jan 13, 2022Updated 4 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆661Feb 25, 2020Updated 5 years ago
- Learning Latent Dynamics for Planning from Pixels☆1,234Mar 24, 2023Updated 2 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- Deep Reinforcement Learning with pytorch & visdom☆804Jul 16, 2020Updated 5 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago