AlignmentResearch / go_attack
☆80Updated last month
Related projects ⓘ
Alternatives and complementary repositories for go_attack
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆98Updated last year
- Play chess against large language models.☆38Updated 9 months ago
- AlphaZero in JAX☆69Updated 7 months ago
- Graphical user interface for the game of Go, and other similar board games☆77Updated last year
- Sandbox for playing with neural nets for Go☆74Updated 5 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated last month
- SAI: a fork of Leela Zero with variable komi.☆106Updated last year
- ☆13Updated 2 years ago
- Leela Master weight is training from leela zero self-play sgf and human sgf file☆51Updated 5 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Artificial go player based on reinforcement and supervised learning☆47Updated last year
- Website for MiniGo, Leela-Zero and other Go data☆20Updated 4 years ago
- AlphaZero based engine for the game of Go (圍棋/围棋).☆89Updated this week
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆66Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆63Updated 2 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆45Updated last year
- Leela - a Go program combining Monte Carlo simulations and Neural Networks.☆67Updated last year
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 6 years ago
- For code etc relating to the network training process.☆15Updated this week
- Server side code of the Leela Zero project☆67Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Run a KataGo bot on OGS with Colab☆24Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆42Updated last year
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 6 months ago
- ♟️ Vectorized RL game environments in JAX☆412Updated last week
- Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection☆14Updated last year
- Code for the paper "Understanding RL Vision"☆43Updated last year