evg-tyurin / alpha-nagibatorLinks

Implementation of self-play based reinforcement learning for Checkers based on the AlphaGo Zero methods.

☆18

Alternatives and similar repositories for alpha-nagibator

Users that are interested in alpha-nagibator are comparing it to the libraries listed below

Sorting:

tuero / muzero-cpp
A C++ pytorch implementation of MuZero
☆38Updated last year
michalsustr / spielviz
SpielViz is an interactive viewer for OpenSpiel games.
☆32Updated last year
NTT123 / a0-jax
AlphaZero in JAX
☆77Updated last year
kevaday / alphazero-general
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆76Updated 5 months ago
hegde95 / Agents_that_Listen
Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory
☆14Updated 3 years ago
kuprel / min-dalle-flax
This contains the Flax model of min(DALL·E) and code for converting it to PyTorch
☆45Updated 2 years ago
petosa / multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
☆34Updated 4 years ago
Zeta36 / connect4-alpha-zero
Connect4 reinforcement learning by AlphaGo Zero methods.
☆113Updated 4 years ago
blanyal / alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…
☆90Updated 7 years ago
Miffyli / rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
☆15Updated 4 years ago
rystrauss / dopamax
Reinforcement learning in pure JAX.
☆13Updated 3 months ago
bhansconnect / fast-alphazero-general
A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆44Updated 2 years ago
chiamp / muzero-cartpole
Applying DeepMind's MuZero algorithm to the cart pole environment in gym
☆21Updated 2 years ago
JoshVarty / AlphaZeroSimple
The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with
☆215Updated 2 years ago
wassname / world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…
☆32Updated 6 years ago
sirmammingtonham / alphastone
Using self-play, MCTS, and a deep neural network to create a hearthstone ai player
☆29Updated 6 years ago
kenjyoung / mctx_learning_demo
☆51Updated 2 years ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
attentionneuron / attentionneuron.github.io
☆26Updated 2 years ago
schrum2 / GameGAN
Interactive GAN evolution of Mario and Zelda levels.
☆54Updated last year
nsavinov / gym-vizdoom
Gym wrapper for Vizdoom environments
☆12Updated 6 years ago
distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
nicofirst1 / rl_werewolf
RL environment replicating the werewolf game to study emergent communication
☆19Updated 2 years ago
cswinter / DeepCodeCraft
Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.
☆21Updated 2 years ago
NTT123 / pax
A stateful pytree library for training neural networks.
☆22Updated 2 years ago
schmidtdominik / Rainbow
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆45Updated 3 years ago
plkmo / AlphaZero_Connect4
PyTorch implementation of AlphaZero Connect from scratch (with results)
☆81Updated 5 years ago
MathieuRita / Lazimpa
Code for the paper LazImpa: Lazy and Impatient neural agents learn to communicate efficiently. Mathieu Rita, Rahma Chaabouni and Emmanuel…
☆16Updated 4 years ago
Egiob / cfrx
cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.
☆31Updated 10 months ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 2 years ago