AlignmentResearch / go_attackLinks
☆89Updated 7 months ago
Alternatives and similar repositories for go_attack
Users that are interested in go_attack are comparing it to the libraries listed below
Sorting:
- AlphaZero in JAX☆78Updated last year
- Scaling scaling laws with board games.☆53Updated 2 years ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆126Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆98Updated last month
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆45Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆79Updated 8 months ago
- Play chess against large language models.☆48Updated last year
- ☆31Updated 3 years ago
- Efficient baselines for autocurricula in JAX.☆196Updated last year
- ☆52Updated 2 years ago
- fast + parallel AlphaZero in JAX☆96Updated 8 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- Genetic programming using LLMs☆43Updated 5 months ago
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- ☆14Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆50Updated last year
- ☆67Updated 3 years ago
- ☆23Updated last year
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- Submissions for AI and Efficiency SOTA's☆56Updated 5 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆84Updated 2 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆151Updated 10 months ago
- The NetHack Learning Environment☆87Updated 2 weeks ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago