wwxFromTju / MARL-101
just for fun
☆12Updated 6 years ago
Related projects: ⓘ
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 6 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- ☆10Updated 7 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- 强化学习面试(未完待续)☆32Updated 4 years ago
- ☆14Updated 3 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆121Updated 5 years ago
- ☆33Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆122Updated 7 years ago
- Implementations of Reinforcement Learning Algorithm☆39Updated 6 years ago
- Implementation to VirtualTaobao☆11Updated 4 years ago
- 1st solution for KDD Cup 2020 (RL track)☆57Updated 4 years ago
- A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.☆68Updated 4 years ago
- Implementation of Pointer Networks using PyTorch☆61Updated last year
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆225Updated 3 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆172Updated 6 years ago
- homework for CS294 Fall 2017☆167Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- sutton 的增强学习导论中文版翻译☆29Updated 6 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆27Updated 4 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- Pytorch implementation of PPO2☆16Updated 5 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated last year
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- Tensorflow implementation of an Actor Critic algorithm using a Pointer Network to solve the TSP (algorithm from Neural Combinatorial Opti…☆42Updated 6 years ago
- ☆96Updated 3 years ago
- Pointer Networks Implementation in Keras☆152Updated 2 years ago