heronsystems / adeptRL
Reinforcement learning framework to accelerate research
☆204Updated 3 years ago
Related projects: ⓘ
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 5 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆137Updated 5 months ago
- Multi Agent Reinforcement Learning using MalmÖ☆245Updated 4 years ago
- ICML 2018 Self-Imitation Learning☆274Updated 4 years ago
- A PyTorch implementation of Rainbow DQN agent☆164Updated 6 years ago
- Actor-critic with experience replay☆251Updated last year
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- NIPS 2017 Value Prediction Network☆165Updated 6 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆195Updated 5 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆299Updated last year
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆133Updated 5 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated 5 months ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆195Updated 3 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆389Updated 11 months ago
- Multitask Environments for RL☆273Updated 3 years ago
- ☆135Updated 3 years ago
- Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.☆555Updated 3 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆213Updated 5 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆359Updated 2 years ago
- Velocity in deep-learning research☆276Updated last year
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- Code for the paper "Evolved Policy Gradients"☆251Updated 5 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆197Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆245Updated last year
- Publicly releasable baselines for the Retro contest☆128Updated 5 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆170Updated 5 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- A PyTorch Platform for Distributed RL☆737Updated 3 years ago
- Random Network Distillation pytorch☆239Updated 5 years ago
- PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)☆152Updated 5 years ago