mldsta / mlds-2018-hw4
☆18Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for mlds-2018-hw4
- homework for CS294 Fall 2017☆168Updated 6 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 6 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆123Updated 5 years ago
- PyTorch Implementation of REINFORCE for both discrete & continuous control☆264Updated 7 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- A toy example of Policy Gradient implemented in Pytorch☆91Updated 6 years ago
- Some notes and experience about David Silver's Reinforcement Learning Course☆46Updated 5 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch☆250Updated 4 years ago
- ☆97Updated 3 years ago
- homework for CS234 2017☆152Updated 6 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆85Updated 3 years ago
- Deep Q-Learning Network in pytorch (not actively maintained)☆386Updated 7 years ago
- https://icml.cc/Conferences/2018/Schedule☆35Updated 6 years ago
- meta-learning research☆159Updated 3 years ago
- ☆29Updated 6 years ago
- Imitation Learning Homework 1☆36Updated 7 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆167Updated 7 years ago
- python implementation of the TPGR☆39Updated 5 years ago
- DGN Code☆335Updated last year
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆173Updated 6 years ago
- Mind-aware Multi-agent Management Reinforcement Learning☆81Updated 5 years ago
- Reinforcement Learning in Python☆107Updated 4 years ago
- Collection of Deep Reinforcement Learning algorithms☆123Updated 7 years ago
- (Beta Version!) Experiment Code for Paper ``CoT: Cooperative Training for Generative Modeling of Discrete Data''☆73Updated 5 years ago
- ☆132Updated 6 years ago
- pytorch implementation of VAE-Gumble-Softmax☆62Updated 4 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆368Updated 5 years ago