AlignmentResearch / go_attack
☆80Updated last month
Related projects ⓘ
Alternatives and complementary repositories for go_attack
- AlphaZero in JAX☆69Updated 7 months ago
- ☆48Updated last year
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆97Updated last year
- Artificial go player based on reinforcement and supervised learning☆47Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated 3 weeks ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆118Updated last year
- AlphaZero based engine for the game of Go (圍棋/围棋).☆88Updated last week
- Sandbox for playing with neural nets for Go☆74Updated 5 years ago
- Computer go engine using Monte-Carlo Tree Search written in Python3.☆56Updated 6 months ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆79Updated 2 weeks ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- ☆12Updated 2 years ago
- Scaling scaling laws with board games.☆40Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆86Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated 5 months ago
- SAI: a fork of Leela Zero with variable komi.☆106Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆110Updated 3 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆65Updated last year
- ☆24Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- ☆65Updated 3 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆14Updated 9 months ago
- General Modules for JAX☆58Updated 3 months ago
- Efficient baselines for autocurricula in JAX.☆173Updated 2 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated last month
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated 6 months ago
- Graphical user interface for the game of Go, and other similar board games☆77Updated 11 months ago