Genius-Society / SnakeAI
Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the field of discrete action space just like in continuous action space. You just need half an hour to train the snake and then it can be as smart as you.|使用深度强化学习玩蛇游戏。 使用的算法是离散的 PPO! 它在离散动作空间领域有着与连续动作空间一样的出色表现。
☆22Updated 3 weeks ago
Alternatives and similar repositories for SnakeAI
Users that are interested in SnakeAI are comparing it to the libraries listed below
Sorting:
- This project is a PyTorch implementation that uses deep CNN to recognize multi-digit numbers using the SVHN dataset derived from Google S…☆15Updated 3 weeks ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆28Updated 2 weeks ago
- 🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄☆118Updated 2 months ago
- Multiagent Reinforcement Learning Research Project☆205Updated 7 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 months ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆141Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆130Updated last year
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆52Updated 4 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆196Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year
- ☆42Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- Transformers (GTrXL & CoBERL) applied to RL tasks☆27Updated 2 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆342Updated 4 years ago
- 动手学强化学习代码☆56Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆155Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆180Updated 8 months ago
- OpenAI团队的深度强化学习教程中文版☆29Updated 5 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆343Updated last month
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆452Updated 2 months ago
- 基于强化学习的空战对抗☆68Updated 3 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆34Updated 2 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆19Updated 7 years ago
- PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Gr…☆222Updated last year
- DQN to play Atari Pong☆115Updated 6 years ago
- UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D☆95Updated last year