Accompanying code for "Deep Reinforcement Learning that Matters"
☆155Sep 22, 2017Updated 8 years ago
Alternatives and similar repositories for DeepReinforcementLearningThatMatters
Users that are interested in DeepReinforcementLearningThatMatters are comparing it to the libraries listed below
Sorting:
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"☆291Jan 20, 2017Updated 9 years ago
- Publicly releasable baselines for the Retro contest☆130Nov 22, 2018Updated 7 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- Implementation of TRPO and related algorithms☆648May 20, 2018Updated 7 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Apr 23, 2017Updated 8 years ago
- Training Sonic with RLlib☆62Apr 2, 2023Updated 2 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Aug 1, 2017Updated 8 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 2 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,048Jun 10, 2023Updated 2 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- [NIPS 2017] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆184Nov 14, 2024Updated last year
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- Implementation of Meta-RL A3C algorithm☆407Feb 22, 2017Updated 9 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆219Jul 22, 2019Updated 6 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- Library for model based RL in robotics☆37Sep 10, 2018Updated 7 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,122Oct 13, 2017Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆552Mar 7, 2019Updated 7 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆975Jan 11, 2019Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 8 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Mar 29, 2017Updated 8 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Feb 8, 2016Updated 10 years ago