EveryInc / AI_DiplomacyLinks
Frontier Models playing the board game Diplomacy.
☆494Updated this week
Alternatives and similar repositories for AI_Diplomacy
Users that are interested in AI_Diplomacy are comparing it to the libraries listed below
Sorting:
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆521Updated this week
- procedural reasoning datasets☆893Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,451Updated 2 months ago
- The Multilayer Perceptron Language Model☆553Updated 10 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆305Updated this week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆312Updated 8 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆448Updated 9 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆802Updated 3 weeks ago
- System 2 Reasoning Link Collection☆839Updated 3 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆173Updated 11 months ago
- Scale your LLM-as-a-judge.☆245Updated this week
- ☆93Updated 8 months ago
- The Autograd Engine☆616Updated 9 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆482Updated last month
- ☆211Updated last week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆737Updated 3 weeks ago
- The Tensor (or Array)☆437Updated 10 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆884Updated 2 weeks ago
- explore token trajectory trees on instruct and base models☆127Updated last month
- ☆128Updated 6 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆405Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆2,236Updated 5 months ago
- ☆1,408Updated 4 months ago
- Textbook on reinforcement learning from human feedback☆1,062Updated last week
- Verifiers for LLM Reinforcement Learning☆1,391Updated last week
- Build your own visual reasoning model☆390Updated last week
- ☆114Updated 6 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 4 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆883Updated 2 months ago