MorvanZhou / Meta-Learning
☆35Updated 4 years ago
Related projects: ⓘ
- Stochastic Variance Reduction Policy Gradient Estimation☆11Updated 5 years ago
- A pack of reinforcement learning algorithms.☆80Updated 2 years ago
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation☆49Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆60Updated 3 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 5 years ago
- 课程笔记,David Silver,CS294 ...☆15Updated 5 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆17Updated 6 years ago
- Minimal implementations of reinforcement learning algorithms by Tensorflow☆29Updated 6 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 4 years ago
- A naive version.☆17Updated 2 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆32Updated 3 years ago
- ☆35Updated 4 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- UCB CS294-112 深度强化学习中文笔记☆48Updated 3 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Updated 4 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 6 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆56Updated 6 years ago
- A Multi-agent Learning Framework☆61Updated 3 years ago
- ☆19Updated 4 years ago
- FEN Code☆36Updated 4 years ago
- OpenAI团队的深度强化学习教程中文版☆71Updated last year
- Maximum Causal Entropy Inverse Reinforcement Learning☆43Updated 5 years ago
- ☆10Updated 4 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 5 years ago
- ☆38Updated this week
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 5 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆76Updated 5 years ago
- ☆53Updated 4 years ago