KuNyaa / berkeleydeeprlcourse-homework-pytorch
Assignments for CS294-112 Fall2018 in Pytorch
☆63Updated 5 years ago
Related projects: ⓘ
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 5 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch☆41Updated 6 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆64Updated 5 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- FEN Code☆36Updated 4 years ago
- A toy example of Policy Gradient implemented in Pytorch☆90Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning☆90Updated 5 years ago
- ☆96Updated 3 years ago
- homework for CS294 Fall 2017☆167Updated 6 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆28Updated last year
- ☆106Updated 4 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- homework for CS234 2017☆152Updated 6 years ago
- Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch☆15Updated 6 years ago
- ☆90Updated 9 months ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆93Updated 4 years ago
- ☆59Updated 6 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆76Updated 5 years ago
- Trust Region Policy Optimization (TRPO) in pure TensorFlow☆18Updated 6 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆56Updated 6 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago