Genius-Society / SnakeAILinks

Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the field of discrete action space just like in continuous action space. You just need half an hour to train the snake and then it can be as smart as you.|使用深度强化学习玩蛇游戏。使用的算法是离散的 PPO！它在离散动作空间领域有着与连续动作空间一样的出色表现。

☆24

Alternatives and similar repositories for SnakeAI

Users that are interested in SnakeAI are comparing it to the libraries listed below

Sorting:

Genius-Society / svhn_recognition
This project is a PyTorch implementation that uses deep CNN to recognize multi-digit numbers using the SVHN dataset derived from Google S…
☆17Updated last month
lqhou / Chinese_ChatBot_DataSet
一个针对中文聊天机器人的公开数据集
☆10Updated 5 years ago
binary-husky / hmp2g
Multiagent Reinforcement Learning Research Project
☆210Updated 7 months ago
XinJingHao / TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
☆89Updated 2 years ago
acezsq / dsx-rl
动手学强化学习代码
☆57Updated last year
philtabor / Multi-Agent-Deep-Deterministic-Policy-Gradients
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
☆345Updated 4 years ago
marlbenchmark / off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
☆483Updated last year
MyRepositories-hub / Simple-Policy-Optimization
☆63Updated 3 weeks ago
jidiai / ai_lib
☆166Updated last year
PaddlePaddle / MetaGym
Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.
☆291Updated 10 months ago
Starlight0798 / gymRL
基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)
☆108Updated 2 months ago
Jingliang-Duan / DSAC-v2
DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic
☆360Updated last week
Quantum-Cheese / DeepReinforcementLearning_Pytorch
Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym
☆57Updated 3 years ago
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆185Updated 6 months ago
PKU-MARL / Multi-Agent-Transformer
☆407Updated last year
jxx123 / rl-tf2
My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0
☆30Updated 3 years ago
Farama-Foundation / gym-examples
Example code for the Gym documentation
☆72Updated last year
deligentfool / policy_based_RL
The implement of the policy gradient RL algorithm with pytorch
☆38Updated 4 years ago
DKuan / MADDPG_torch
The code for maddpg using pytorch
☆168Updated 4 years ago
Phoenix-Shen / ReinforcementLearning
强化学习算法库，包含了目前主流的强化学习算法(Value based and Policy based)的代码，代码都经过调试并可以运行
☆86Updated last year
beeevita / Classical-Chinese-NER-RE-Dataset
A dataset used for NLP tasks.
☆9Updated 4 years ago
kimbring2 / AlphaStar_Implementation
This project is implementation code of AlphaStar
☆200Updated last year
tinyzqh / light_mappo
Lightweight version of MAPPO to help you quickly migrate to your local environment.
☆680Updated 3 months ago
Git-123-Hub / maddpg-pettingzoo-pytorch
implementation of MADDPG using PettingZoo and PyTorch
☆144Updated last year
cyanrain7 / TRPO-in-MARL
☆204Updated 2 years ago
tjuHaoXiaotian / pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…
☆157Updated last year
philtabor / Multi-Agent-Reinforcement-Learning
PyTorch implementations of MADDPG, MAPPO (coming)
☆147Updated last year
DavidMouse1118 / Reinforcement-Learning-Maze-World
SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis
☆30Updated 5 years ago
jianzhnie / deep-marl-toolkit
MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…
☆137Updated last year
jiseongHAN / Super-Mario-RL
🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄
☆120Updated 2 weeks ago