buaazhangfan / CS294-112-Deep-Reinforcement-Learning
Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018
☆16Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for CS294-112-Deep-Reinforcement-Learning
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Updated 5 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 6 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆65Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated last year
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆69Updated last year
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- Autoregressive policies for continuous control reinforcement learning☆28Updated 5 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆15Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Implementation of Random Expert Distillation☆29Updated 5 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆33Updated 5 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- Multi-agent algorithm based on counterfactual multi-agent policy gradients☆7Updated 5 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 6 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Updated last year
- Ranking Policy Gradient☆23Updated 4 years ago