AlignmentResearch / go_attack
☆84Updated last month
Alternatives and similar repositories for go_attack:
Users that are interested in go_attack are comparing it to the libraries listed below
- AlphaZero in JAX☆74Updated 10 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆107Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆79Updated 2 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆127Updated last year
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- Code for the paper "Understanding RL Vision"☆46Updated last year
- Scaling scaling laws with board games.☆47Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Efficient baselines for autocurricula in JAX.☆179Updated 5 months ago
- ☆13Updated 2 years ago
- ☆14Updated 10 months ago
- For code etc relating to the network training process.☆18Updated last month
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 6 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- Levin tree search guided by both a policy and a heuristic function☆17Updated last year
- ☆49Updated last year
- ☆31Updated 2 years ago
- Artificial go player based on reinforcement and supervised learning☆47Updated last year
- General Modules for JAX☆63Updated 6 months ago
- fast + parallel AlphaZero in JAX☆92Updated 2 months ago
- Interpreting how transformers simulate agents performing RL tasks☆77Updated last year
- Collection of RL Environments built using Madrona☆28Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- Sandbox for playing with neural nets for Go☆76Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆47Updated last year
- Enable moe for nanogpt.☆22Updated last year
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆32Updated last year
- A tool to automate installing Atari ROMs for the Arcade Learning Environment☆78Updated last year