MorvanZhou / Meta-Learning
☆35Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Meta-Learning
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆18Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Updated 5 years ago
- Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation☆49Updated 4 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 4 years ago
- 课程笔记,David Silver,CS294 ...☆15Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆77Updated 5 years ago
- ☆19Updated 4 years ago
- A pack of reinforcement learning algorithms.☆81Updated 3 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 6 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆33Updated 3 years ago
- FEN Code☆37Updated 5 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆27Updated 4 years ago
- ☆97Updated 3 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆56Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- Codes for Stackelberg GAN☆12Updated 5 years ago
- ☆53Updated 4 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago