Genius-Society / SnakeAILinks
Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the field of discrete action space just like in continuous action space. You just need half an hour to train the snake and then it can be as smart as you.|使用深度强化学习玩蛇游戏。 使用的算法是离散的 PPO! 它在离散动作空间领域有着与连续动作空间一样的出色表现。
☆24Updated last month
Alternatives and similar repositories for SnakeAI
Users that are interested in SnakeAI are comparing it to the libraries listed below
Sorting:
- This project is a PyTorch implementation that uses deep CNN to recognize multi-digit numbers using the SVHN dataset derived from Google S…☆17Updated last month
- 一个针对中文聊天机器人的公开数据集☆10Updated 5 years ago
- Multiagent Reinforcement Learning Research Project☆210Updated 7 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆89Updated 2 years ago
- 动手学强化学习代码☆57Updated last year
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆345Updated 4 years ago
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆483Updated last year
- ☆63Updated 3 weeks ago
- ☆166Updated last year
- Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.☆291Updated 10 months ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆108Updated 2 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆360Updated last week
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆185Updated 6 months ago
- ☆407Updated last year
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆30Updated 3 years ago
- Example code for the Gym documentation☆72Updated last year
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- The code for maddpg using pytorch☆168Updated 4 years ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆86Updated last year
- A dataset used for NLP tasks.☆9Updated 4 years ago
- This project is implementation code of AlphaStar☆200Updated last year
- Lightweight version of MAPPO to help you quickly migrate to your local environment.☆680Updated 3 months ago
- implementation of MADDPG using PettingZoo and PyTorch☆144Updated last year
- ☆204Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆157Updated last year
- PyTorch implementations of MADDPG, MAPPO (coming)☆147Updated last year
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆30Updated 5 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆137Updated last year
- 🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄☆120Updated 2 weeks ago