GoodStartLabs / AI_DiplomacyLinks
Frontier Models playing the board game Diplomacy.
☆577Updated 2 weeks ago
Alternatives and similar repositories for AI_Diplomacy
Users that are interested in AI_Diplomacy are comparing it to the libraries listed below
Sorting:
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆692Updated this week
- Testing baseline LLMs performance across various models☆307Updated last month
- Decentralized RL Training at Scale☆592Updated this week
- ☆417Updated 3 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated 11 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆313Updated 2 months ago
- ☆464Updated 3 months ago
- procedural reasoning datasets☆1,102Updated this week
- The history files when recording human interaction while solving ARC tasks☆115Updated this week
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆77Updated 9 months ago
- Training-Ready RL Environments + Evals☆90Updated last week
- ☆165Updated 8 months ago
- System 2 Reasoning Link Collection☆852Updated 6 months ago
- ☆497Updated last month
- Build your own visual reasoning model☆408Updated 3 weeks ago
- large population models☆408Updated last week
- ☆477Updated 2 months ago
- The Multilayer Perceptron Language Model☆562Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆322Updated 10 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆458Updated last month
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆813Updated last month
- The Autograd Engine☆630Updated last year
- ☆91Updated 11 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,145Updated 7 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆84Updated this week
- Provider-agnostic, open-source evaluation infrastructure for language models☆531Updated this week
- Inference-time scaling for LLMs-as-a-judge.☆297Updated 2 weeks ago
- ☆58Updated 2 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆557Updated last month
- Build hours code to share.☆546Updated 3 weeks ago