Frontier Models playing the board game Diplomacy.
☆633Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for AI_Diplomacy
Users that are interested in AI_Diplomacy are comparing it to the libraries listed below
Sorting:
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,550Jan 12, 2025Updated last year
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,781Apr 18, 2025Updated 10 months ago
- Retail Search with AI☆14Feb 14, 2026Updated 2 weeks ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Feb 11, 2026Updated 2 weeks ago
- ☆67May 23, 2025Updated 9 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,099Aug 26, 2025Updated 6 months ago
- Minimal reproduction of DeepSeek R1-Zero☆12,853Updated this week
- NanoGPT (124M) in 2 minutes☆4,679Updated this week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆335Nov 2, 2025Updated 4 months ago
- ☆20Feb 10, 2026Updated 3 weeks ago
- Implementation of Diffusion Transformer (DiT) in JAX☆306Jun 11, 2024Updated last year
- Bastien One is being developed as autonomous A.I. bot with the capacity to complete complex tasks - either by itself or by creating addit…☆16Mar 9, 2025Updated 11 months ago
- An introduction to LLM Sampling☆79Dec 15, 2024Updated last year
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- ☆22Oct 27, 2025Updated 4 months ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 2 weeks ago
- Weekly Vibecast Live coding sessions with rUv. Check branches for each week.☆36Feb 22, 2026Updated last week
- train entropix like a champ!☆20Oct 10, 2024Updated last year
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 10 months ago
- AutonomousSphere is an agentic collaboration server. Agents talk, act, and use tools like teammates. Federated servers form an internet o…☆16May 13, 2025Updated 9 months ago
- ☆12Jun 28, 2024Updated last year
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,042Apr 27, 2025Updated 10 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Professional Wargaming LLM Toolbox☆20Jul 9, 2025Updated 7 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Universal MCP IdP (Identity Provider) - Support Thousands of Integrations, Zero Maintenance☆30Dec 25, 2025Updated 2 months ago
- vyai – A lightweight CLI tool to interact with the Gemini API from the terminal.☆11Dec 8, 2025Updated 2 months ago
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆14Feb 11, 2026Updated 3 weeks ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆13May 28, 2025Updated 9 months ago
- The official Node JS SDK for Smallest.ai☆14Feb 18, 2026Updated 2 weeks ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Azure AI Visual Search toolkit☆15Oct 25, 2022Updated 3 years ago
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated last year
- Simple MPI implementation for prototyping or learning☆303Aug 6, 2025Updated 6 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 6 months ago
- Demo of ADK (Agent Development Kit) as an MCP (Model Context Protocol) client for flight search capabilities.☆33Jun 1, 2025Updated 9 months ago
- A PyTorch native platform for training generative AI models☆5,098Updated this week