GoodStartLabs / AI_DiplomacyLinks
Frontier Models playing the board game Diplomacy.
☆611Updated this week
Alternatives and similar repositories for AI_Diplomacy
Users that are interested in AI_Diplomacy are comparing it to the libraries listed below
Sorting:
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆780Updated this week
- Testing baseline LLMs performance across various models☆330Updated last month
- Async RL Training at Scale☆976Updated this week
- ☆543Updated 6 months ago
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆79Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆314Updated 6 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆454Updated last year
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,774Updated 4 months ago
- System 2 Reasoning Link Collection☆864Updated 9 months ago
- AlphaGo Moment for Model Architecture Discovery.☆1,127Updated last month
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.☆1,085Updated 2 weeks ago
- ☆482Updated 5 months ago
- Thermodynamic Hypergraphical Model Library in JAX☆974Updated last month
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆126Updated this week
- open source interpretability platform 🧠☆611Updated this week
- Build your own visual reasoning model☆415Updated last month
- Curated collection of community environments☆196Updated last week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆829Updated 5 months ago
- ☆596Updated 7 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,180Updated 11 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆776Updated 2 weeks ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 10 months ago
- Distributed Training Over-The-Internet☆973Updated 2 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,290Updated 2 weeks ago
- Inference-time scaling for LLMs-as-a-judge.☆320Updated 2 months ago
- Gradient descent is cool and all, but what if we could delete it?☆104Updated 4 months ago
- An interface library for RL post training with environments.☆944Updated last week
- Provider-agnostic, open-source evaluation infrastructure for language models☆699Updated last week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆345Updated last year
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,687Updated last week