camellyx / 10707-deep-learning-project
Deep Learning Course Project
☆12Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for 10707-deep-learning-project
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆36Updated 5 years ago
- OpenAI Gym environment for Robot Soccer Goal☆17Updated 5 years ago
- ☆25Updated 6 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆13Updated 7 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- ☆71Updated 5 months ago
- A curated list of awesome Model-based reinforcement learning resources☆90Updated 4 years ago
- Notes and comments about Deep Reinforcement Learning papers☆76Updated 6 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- ☆53Updated 6 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated last year
- An environment for an obstacle avoidance task☆34Updated 3 years ago
- Hierarchical Deep RL Network☆30Updated 7 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆26Updated 5 years ago
- soft q learning and soft actor critic☆15Updated 5 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago