AlignmentResearch / go_attackLinks
☆86Updated 5 months ago
Alternatives and similar repositories for go_attack
Users that are interested in go_attack are comparing it to the libraries listed below
Sorting:
- AlphaZero in JAX☆77Updated last year
- Play chess against large language models.☆47Updated last year
- Sandbox for playing with neural nets for Go☆76Updated 6 years ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆123Updated last year
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- Graphical user interface for the game of Go, and other similar board games☆85Updated 3 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 10 months ago
- 21.1 million Go games, 18k-9p☆129Updated 5 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- SAI: a fork of Leela Zero with variable komi.☆108Updated last year
- An environment of the board game Go using OpenAI's Gym API☆173Updated 3 years ago
- Leela Master weight is training from leela zero self-play sgf and human sgf file☆51Updated 5 years ago
- Website for MiniGo, Leela-Zero and other Go data☆20Updated 4 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆94Updated 4 months ago
- Artificial go player based on reinforcement and supervised learning☆48Updated 2 years ago
- Scaling scaling laws with board games.☆49Updated last year
- Leela - a Go program combining Monte Carlo simulations and Neural Networks.☆68Updated 2 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆172Updated 2 years ago
- ☆73Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- ☆31Updated 2 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆83Updated 2 years ago
- ☆34Updated 2 years ago
- Find best-response to a fixed policy in multi-agent RL☆287Updated 3 years ago
- Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.☆64Updated 3 months ago
- Web application where humans can play Overcooked with AI agents.☆58Updated 2 years ago