wchliao / RL_overview_note
Note: "Deep Reinforcement Learning: An Overview"
☆14Updated 6 years ago
Related projects: ⓘ
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆38Updated 6 years ago
- Deep Q Network implements by Tensorflow☆25Updated 6 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 4 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- ZForcing Repo☆40Updated 6 years ago
- ☆13Updated this week
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- 📉 A collection of TensorBoard-related utilities (In Progress)☆37Updated last year
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆17Updated 7 years ago
- Optimizers in tensorflow from scratch☆17Updated 7 years ago
- Combining deep learning and reinforcement learning.☆81Updated 2 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- ☆42Updated 5 years ago
- Tensorflow implementation of A3C algorithm☆48Updated 7 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 6 years ago
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 2 years ago
- An optimized version of SeqGAN in pytorch☆12Updated 6 years ago
- Code for the paper "SelectiveNet: A Deep Neural Network with an Integrated Reject Option"☆12Updated 5 years ago
- ☆42Updated 6 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆17Updated 5 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 6 years ago
- ☆46Updated 6 years ago
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆51Updated 5 years ago
- Notes for Deep Learning Papers☆19Updated 5 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- ☆7Updated 7 years ago