bluemoonwencong / note-on-Deep-Reinforcement-Learning
课程笔记,David Silver,CS294 ...
☆15Updated 6 years ago
Alternatives and similar repositories for note-on-Deep-Reinforcement-Learning:
Users that are interested in note-on-Deep-Reinforcement-Learning are comparing it to the libraries listed below
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated 2 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆34Updated 4 years ago
- FEN Code☆37Updated 5 years ago
- Implementation of Random Expert Distillation☆29Updated 5 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Updated 6 years ago
- ☆97Updated 4 years ago
- Autonomous Navigation using Deep Reinforcement Learning☆24Updated 7 years ago
- Safe Reinforcement Learning algorithms☆75Updated 2 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- Multi-agent algorithm based on counterfactual multi-agent policy gradients☆7Updated 6 years ago
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Updated 8 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 6 years ago
- My solutions to CS285 2019 Fall of UC Berkeley☆14Updated 2 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- References at the Intersection of Causality and Reinforcement Learning☆89Updated 4 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago