WorldEditor50 / snakeAI
testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI
☆10Updated last week
Related projects: ⓘ
- [IJCAI 2022] "Dynamic Sparse Training for Deep Reinforcement Learning" by Ghada Sokar, Elena Mocanu , Decebal Constantin Mocanu, Mykola P…☆11Updated 2 years ago
- Distributed Deep Reinforcement Learning☆29Updated 3 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated 10 months ago
- Unofficial code for online decision transformer☆37Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆47Updated last year
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13Updated 5 years ago
- StarCraft II Reinforcement Learning with Pytorch - Mini Games☆23Updated 6 years ago
- Blazingly Fast Implementation of Deep Q-Network in C++ with NNabla☆14Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆16Updated 3 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- A C++ pytorch implementation of MuZero☆30Updated 4 months ago
- WIP implementation of https://arxiv.org/pdf/1901.08162.pdf☆9Updated 4 years ago
- ☆26Updated last year
- Implementation of GAIL and AIRL using chinerrl☆16Updated 2 years ago
- ☆30Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆37Updated 3 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Updated 5 years ago
- DeeCamp 2019 最佳团队 斗地主出牌引擎☆14Updated 3 years ago
- ☆36Updated last year
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆47Updated last year
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆19Updated last year
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆19Updated 2 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆32Updated 3 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆17Updated 7 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆25Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆21Updated last year
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆14Updated 6 years ago
- ☆21Updated 2 years ago
- ☆29Updated last year
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆77Updated last year