whitepaper / RL-Zoo
Implementations of Reinforcement Learning Algorithm
☆39Updated 6 years ago
Related projects: ⓘ
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- ☆25Updated 3 years ago
- Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch☆15Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- reproduce some RL or Multi-Agent models☆35Updated 5 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 6 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- A pack of reinforcement learning algorithms.☆80Updated 2 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 5 years ago
- ☆96Updated 3 years ago
- FEN Code☆36Updated 4 years ago
- 强化学习面试(未完待续)☆32Updated 4 years ago
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation☆49Updated 4 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 5 years ago
- ☆97Updated this week
- ☆28Updated last year
- just for fun☆12Updated 6 years ago
- ☆33Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆76Updated 5 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆79Updated 6 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- A toy example of Policy Gradient implemented in Pytorch☆90Updated 6 years ago
- homework for CS294 Fall 2017☆167Updated 6 years ago
- A Multi-agent Learning Framework☆61Updated 3 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)☆24Updated 6 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- ☆18Updated 5 years ago
- ☆38Updated this week