A framework for pitting LLMs against each other in an evolving library of games ⚔
☆35Apr 17, 2025Updated last year
Alternatives and similar repositories for ZeroSumEval
Users that are interested in ZeroSumEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Documentation for DSPy Library☆24Updated this week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆28Mar 6, 2024Updated 2 years ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆46Updated this week
- ☆11Dec 11, 2024Updated last year
- 🐇A rabbit-fast Rust reimplementation inspired by Claude Code, with native TUI, deeper tooling, and a cleaner path for terminal-first AI …☆43Apr 9, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Environments by the Prime Intellect Research Team☆57Updated this week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 11 months ago
- Programmable chat templates for LLM training and inference.☆109Updated this week
- A CLI tool you can pipe code and then ask for changes, add documentation, etc, using the OpenAI API.☆13Jan 5, 2024Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- ☆13Feb 20, 2020Updated 6 years ago
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆10Aug 4, 2022Updated 3 years ago
- ☆18Sep 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- moodist☆28Apr 23, 2026Updated last month
- Conversations with Search Engines☆14Jun 12, 2023Updated 2 years ago
- LLM code editor for backend services☆16Oct 19, 2024Updated last year
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Nov 12, 2024Updated last year
- DSPY on action with OpenSource LLMs.☆107Apr 9, 2024Updated 2 years ago
- nyc is so back☆21Jun 27, 2025Updated 11 months ago
- The dataset and code for PeerSum at EMNLP'23.☆16Oct 20, 2025Updated 7 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆452Feb 13, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆65May 8, 2025Updated last year
- A collection of implementations of fair ML algorithms☆12Jan 7, 2018Updated 8 years ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- 这是对基于大模型的多智能体系统论文的总结☆10Jun 23, 2024Updated last year
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated 2 years ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆43Apr 15, 2026Updated last month
- ☆13Jan 27, 2019Updated 7 years ago
- ☆17Feb 12, 2025Updated last year
- ☆20Mar 22, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆16Jan 3, 2023Updated 3 years ago
- ☆27Nov 19, 2025Updated 6 months ago
- Iterative specification refinement tool: feeds your docs through GPT Pro Extended Reasoning via Oracle for multiple revision rounds until…☆59Mar 22, 2026Updated 2 months ago
- An IDE for AI coding☆30Updated this week
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆21Feb 5, 2021Updated 5 years ago
- Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)☆48Dec 27, 2021Updated 4 years ago