Actor-critic with experience replay
☆257Oct 9, 2022Updated 3 years ago
Alternatives and similar repositories for ACER
Users that are interested in ACER are comparing it to the libraries listed below
Sorting:
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- Deep Reinforcement Learning with pytorch & visdom☆804Jul 16, 2020Updated 5 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,316Sep 25, 2019Updated 6 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- A3C LSTM Atari with Pytorch plus A3G design☆570Apr 18, 2023Updated 2 years ago
- ☆58Aug 28, 2018Updated 7 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- ☆38Mar 6, 2017Updated 8 years ago
- PyTorch implementation of both discrete and continuous ACER☆24Jan 27, 2019Updated 7 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 7 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆66May 22, 2017Updated 8 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Mar 29, 2017Updated 8 years ago
- implement of prioritized experience replay☆159Aug 20, 2018Updated 7 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆629Aug 13, 2018Updated 7 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 8 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 3 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆348Nov 22, 2018Updated 7 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆263Feb 8, 2018Updated 8 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Mar 13, 2017Updated 8 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆186Nov 1, 2017Updated 8 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,045Jun 10, 2023Updated 2 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆201Jun 3, 2017Updated 8 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago