AlignmentResearch / go_attackLinks
☆91Updated 11 months ago
Alternatives and similar repositories for go_attack
Users that are interested in go_attack are comparing it to the libraries listed below
Sorting:
- AlphaZero in JAX☆81Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆116Updated 5 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆84Updated last year
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆129Updated 2 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- fast + parallel AlphaZero in JAX☆108Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- ☆15Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- Efficient baselines for autocurricula in JAX.☆201Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 3 months ago
- Play chess against large language models.☆49Updated 3 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆85Updated 3 years ago
- ☆53Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆166Updated 4 years ago
- ☆31Updated 3 years ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆132Updated 2 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆34Updated 6 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94Updated 2 years ago
- An implementation of MuZero in JAX.☆58Updated 3 years ago
- SpielViz is an interactive viewer for OpenSpiel games.☆37Updated last year
- ☆60Updated last year
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆163Updated last year
- ☆15Updated 9 years ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆115Updated last year
- Web application where humans can play Overcooked with AI agents.☆60Updated 3 years ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 10 months ago