kenjyoung / Neurohex
☆13Updated 2 years ago
Alternatives and similar repositories for Neurohex:
Users that are interested in Neurohex are comparing it to the libraries listed below
- Hex board game AI with self-play learning based on the AlphaZero algorithm☆34Updated 5 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- gui for board game hex (and Y) by broderick arneson☆13Updated last year
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- ☆65Updated last year
- Skip Context Tree Switching - Reference Implementation☆49Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆76Updated 5 years ago
- some common TD Learning algorithms☆67Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆81Updated 5 years ago
- Implementation of TD-Gammon in TensorFlow.☆111Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- Scaling scaling laws with board games.☆48Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆114Updated 7 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆49Updated 2 years ago
- ☆85Updated 4 years ago
- learning to play atari games with reinforcement learning☆10Updated 9 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- krazy grid world☆25Updated 5 years ago
- Framework for writing bots that play Hanabi.☆37Updated 5 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- An implementation of Deep Q-Network using Caffe☆69Updated 9 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago