AlignmentResearch / go_attack
☆85Updated 3 months ago
Alternatives and similar repositories for go_attack:
Users that are interested in go_attack are comparing it to the libraries listed below
- Sandbox for playing with neural nets for Go☆75Updated 6 years ago
- AlphaZero in JAX☆77Updated last year
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆118Updated last year
- Play chess against large language models.☆46Updated last year
- ☆31Updated 2 years ago
- Code for the paper "Understanding RL Vision"☆47Updated 2 years ago
- ☆14Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆128Updated last year
- Scaling scaling laws with board games.☆48Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆89Updated 2 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆24Updated last month
- SAI: a fork of Leela Zero with variable komi.☆107Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 4 years ago
- Leela - a Go program combining Monte Carlo simulations and Neural Networks.☆68Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆125Updated 6 months ago
- Artificial go player based on reinforcement and supervised learning☆48Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆187Updated 8 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆157Updated 4 years ago
- ☆51Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 4 months ago
- Graphical user interface for the game of Go, and other similar board games☆82Updated last month
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Website for MiniGo, Leela-Zero and other Go data☆20Updated 4 years ago
- Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.☆52Updated 3 weeks ago
- A networking protocol for agent-environment communication☆102Updated 2 months ago
- fast + parallel AlphaZero in JAX☆95Updated 4 months ago
- ☆12Updated 3 months ago