Alx-AI / AI_Diplomacy

☆219

Alternatives and similar repositories for AI_Diplomacy

Users that are interested in AI_Diplomacy are comparing it to the libraries listed below

Sorting:

EurekaLabsAI / mlp
The Multilayer Perceptron Language Model
☆548Updated 9 months ago
open-thought / reasoning-gym
procedural reasoning datasets
☆580Updated this week
EurekaLabsAI / tensor
The Tensor (or Array)
☆432Updated 9 months ago
NousResearch / atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆357Updated this week
open-thought / system-2-research
System 2 Reasoning Link Collection
☆833Updated 2 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆46Updated 2 months ago
groundlight / r1_vlm
Build your own visual reasoning model
☆362Updated this week
brendanhogan / DeepSeekRL-Extended
Exploring Applications of GRPO
☆212Updated last week
jerber / lang-jepa
☆111Updated 4 months ago
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆174Updated 9 months ago
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆156Updated this week
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆180Updated this week
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆450Updated this week
kmohan321 / Research_Papers
☆46Updated last month
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆194Updated 2 weeks ago
VachanVY / Reinforcement-Learning
PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …
☆119Updated this week
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆98Updated 2 months ago
google-deepmind / mishax
☆129Updated last month
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆142Updated 2 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆136Updated 10 months ago
callummcdougall / ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
☆211Updated last year
ash-01xor / bpe.c
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆129Updated 6 months ago
PrimeIntellect-ai / prime-rl
prime-rl is a codebase for decentralized RL training at scale
☆211Updated this week
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆275Updated 11 months ago
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆345Updated 2 months ago
0xD4rky / Vision-Transformers
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…
☆216Updated 4 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆154Updated last month
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆455Updated last week
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆109Updated 3 weeks ago
jerber / arc_agi
☆54Updated 3 months ago