buaazhangfan / CS294-112-Deep-Reinforcement-Learning
Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018
☆16Updated 6 years ago
Alternatives and similar repositories for CS294-112-Deep-Reinforcement-Learning:
Users that are interested in CS294-112-Deep-Reinforcement-Learning are comparing it to the libraries listed below
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Guided-Meta Policy Search☆41Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Updated 5 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning☆90Updated 5 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆96Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- Implementation of Behavioral Cloning from Observationmentation☆16Updated 5 years ago
- ☆41Updated 6 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated last year
- Mind-aware Multi-agent Management Reinforcement Learning☆81Updated 5 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 3 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 5 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆75Updated 5 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Multi-agent algorithm based on counterfactual multi-agent policy gradients☆7Updated 6 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago
- Source code for our NIPS 2017 paper, InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆42Updated 7 years ago