GoodStartLabs / AI_DiplomacyLinks
Frontier Models playing the board game Diplomacy.
☆624Updated 3 weeks ago
Alternatives and similar repositories for AI_Diplomacy
Users that are interested in AI_Diplomacy are comparing it to the libraries listed below
Sorting:
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆843Updated this week
- Testing baseline LLMs performance across various models☆336Updated last week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆535Updated 2 weeks ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆315Updated 7 months ago
- Async RL Training at Scale☆1,020Updated this week
- ☆482Updated 6 months ago
- open source interpretability platform 🧠☆656Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,318Updated last week
- Curated collection of community environments☆205Updated this week
- Provider-agnostic, open-source evaluation infrastructure for language models☆712Updated last month
- Thermodynamic Hypergraphical Model Library in JAX☆985Updated 2 months ago
- ☆552Updated 7 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆143Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆457Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆829Updated 5 months ago
- The history files when recording human interaction while solving ARC tasks☆117Updated this week
- Build your own visual reasoning model☆417Updated last week
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- Inference-time scaling for LLMs-as-a-judge.☆326Updated 2 months ago
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆78Updated last year
- An interface library for RL post training with environments.☆1,066Updated last week
- ☆615Updated 8 months ago
- Prompts used in the Automated Auditing Blog Post☆135Updated 6 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆514Updated last month
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆815Updated last week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆345Updated last year
- ☆116Updated last week
- II-Researcher: a new open-source framework designed to aid building search / research agents☆492Updated 5 months ago
- On-device intelligence.☆393Updated 10 months ago
- AlphaGo Moment for Model Architecture Discovery.☆1,128Updated last month