deep-reinforcement-learning-book / Chapter15-AlphaZero
Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆29Updated 4 years ago
Related projects: ⓘ
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆38Updated this week
- A Multi-agent Learning Framework☆61Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- ☆28Updated last year
- ☆25Updated 3 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- ☆39Updated 2 years ago
- Multi-Agent Determinantal Q-Learning☆41Updated last year
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆60Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- Distributed Deep Reinforcement Learning☆29Updated 3 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆1Updated 5 years ago
- A pack of reinforcement learning algorithms.☆80Updated 2 years ago
- ☆15Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆47Updated last year
- RLA is a tool for managing your RL experiments automatically☆70Updated last year
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated last year
- FEN Code☆36Updated 4 years ago
- Personal Repo to keep track of RL papers☆31Updated 3 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- ☆96Updated 3 years ago
- ☆18Updated 5 years ago
- A simple 2D ball collision engine.☆12Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆69Updated last year
- Decision Transformer: A brand new Offline RL Pattern.☆33Updated 2 years ago
- ☆26Updated last year
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated 10 months ago