haje01 / distperLinks
Distributed Priortized Experience Replay
☆10Updated 6 years ago
Alternatives and similar repositories for distper
Users that are interested in distper are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Updated 3 months ago
- Deep Multi-Agent Reinforcement Learning with StarCraft 2☆10Updated 4 years ago
- Repository for our ICML 2019 paper: Curiosity-Bottleneck☆34Updated 2 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Updated 6 years ago
- Repository for studying distributional rl☆30Updated 4 months ago
- Yet Another Reinforcement Learning Tutorial☆73Updated 2 years ago
- RLOpensource / IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures☆37Updated 5 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Updated 6 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆27Updated 6 years ago
- ☆49Updated 6 years ago
- Generalised UDRL☆37Updated 3 years ago
- implementation of distributed reinforcement learning with distributed tensorflow☆56Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- ☆10Updated 6 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Updated 4 years ago
- ☆18Updated last year
- Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)☆73Updated 6 years ago
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Updated last year
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆16Updated 3 years ago
- Implementation of Neural Episodic Control in Tensorflow☆27Updated 6 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago
- Scalable distributed reinforcement learning agents on kubernetes☆57Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆49Updated 3 years ago
- ☆40Updated 5 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 6 years ago
- ☆30Updated 3 years ago
- RAD: Reinforcement Learning with Augmented Data (code for state augmentation)☆11Updated 4 years ago