A framework for pitting LLMs against each other in an evolving library of games ⚔
☆34Apr 20, 2025Updated 10 months ago
Alternatives and similar repositories for ZeroSumEval
Users that are interested in ZeroSumEval are comparing it to the libraries listed below
Sorting:
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- An advanced research assistant that utilizes AI agents to generate novel research directions and analyze scientific literature. This plat…☆16Feb 26, 2025Updated last year
- ☆18Apr 18, 2025Updated 10 months ago
- [SIGIR'25 Short] Official Repository of "KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking"☆29Dec 17, 2025Updated 2 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Mar 20, 2025Updated 11 months ago
- Testing LLM reasoning abilities with lineage relationship quizzes.☆36Feb 2, 2026Updated last month
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 7 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- benchmarks for LLM tokenizers☆17Updated this week
- An experimental framework that democratizes access to distributed serverless compute.☆25Feb 25, 2026Updated last week
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆14Nov 11, 2025Updated 3 months ago
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Mar 26, 2024Updated last year
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆36Apr 9, 2025Updated 10 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 8 months ago
- Agentic data transformation on infinite amounts of data☆142Jan 31, 2026Updated last month
- ☆27Updated this week
- Physics-Informed Neural Networks for Cardiovascular Blood Flow Simulations☆19Apr 7, 2025Updated 10 months ago
- ESG Insights AI simplifies ESG data analysis with advanced AI models, ensuring compliance with GRI standards. It helps asset managers ass…☆13Oct 31, 2024Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Mar 24, 2025Updated 11 months ago
- A frontend interface for interacting with AI Models. Compatible with Ollama and OpenAI☆10May 1, 2025Updated 10 months ago
- ☆10Apr 26, 2023Updated 2 years ago
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆28May 17, 2025Updated 9 months ago
- A comprehensive Python and R-based toolkit for clustering and sorting electrophysiology data recorded using Intan RHD2132 chips. Original…☆10Updated this week
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- A Python package for amino acid sequence analysis. Proforma 2.1 complicant.☆14Feb 6, 2026Updated 3 weeks ago
- ☆10Nov 17, 2022Updated 3 years ago
- Emphasizes AI-based projects for various companies.☆15Apr 1, 2025Updated 11 months ago
- A Discord bot to retrieve Shopify Orders and Statistics☆10Dec 9, 2025Updated 2 months ago
- React Native, Right Now (rn-rn)☆18Sep 2, 2025Updated 6 months ago
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 5 years ago
- Implementation of UniSpec, a deep learning model for predicting full fragment ion peptide spectra.☆12Feb 11, 2025Updated last year
- Shakey OS Mobile AI Framework for React Native allowing people to build React Native apps for IOS and Android with AI tooling and wallet …☆28Feb 3, 2025Updated last year
- "Open-source toolkit (Python Library, Registry API, CLI) for secure, decentralized AI agent interoperability using A2A/MCP."☆14May 10, 2025Updated 9 months ago
- ☆12Aug 1, 2025Updated 7 months ago