allenjywang / Reinforcement_Learning_NotesLinks
A naive version.
☆19Updated 3 years ago
Alternatives and similar repositories for Reinforcement_Learning_Notes
Users that are interested in Reinforcement_Learning_Notes are comparing it to the libraries listed below
Sorting:
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- ☆43Updated last month
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- ☆25Updated 3 years ago
- DQN examples codes in chapter 4☆43Updated 2 years ago
- 天授中文文档☆58Updated 6 months ago
- ☆17Updated 3 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 3 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 3 years ago
- ☆165Updated last year
- Hierarchical-DQN in pytorch (not actively maintained)☆69Updated 8 years ago
- ☆124Updated 3 years ago
- ☆36Updated 5 years ago
- RL-code for beginners. Enjoying!☆115Updated 5 years ago
- ☆10Updated 3 years ago
- 此项目中将上传我在B站《强化学习理论基础》系列视频中的板书、参考资料等内容。☆76Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆52Updated 4 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆41Updated 3 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆70Updated last year
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆20Updated 6 years ago
- ☆42Updated 2 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆73Updated 2 years ago
- ☆99Updated 4 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Updated 5 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- Some notes and experience about David Silver's Reinforcement Learning Course☆46Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- ☆38Updated 2 years ago