AlignmentResearch / go_attackLinks
☆86Updated 6 months ago
Alternatives and similar repositories for go_attack
Users that are interested in go_attack are comparing it to the libraries listed below
Sorting:
- AlphaZero in JAX☆78Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆124Updated last year
- ☆31Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆94Updated this week
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 10 months ago
- Scaling scaling laws with board games.☆49Updated 2 years ago
- Play chess against large language models.☆47Updated last year
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 7 months ago
- ☆23Updated last year
- ☆34Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆78Updated 7 months ago
- ☆52Updated 2 years ago
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- Efficient baselines for autocurricula in JAX.☆190Updated 10 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆83Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- ☆56Updated last year
- ☆51Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Sandbox for playing with neural nets for Go☆76Updated 6 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Submissions for AI and Efficiency SOTA's☆56Updated 5 years ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆69Updated last week
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago