A framework for pitting LLMs against each other in an evolving library of games ⚔
☆35Apr 17, 2025Updated 11 months ago
Alternatives and similar repositories for ZeroSumEval
Users that are interested in ZeroSumEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Documentation for DSPy Library☆23Mar 27, 2026Updated 2 weeks ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Apr 20, 2025Updated 11 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆27Mar 6, 2024Updated 2 years ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆45Mar 20, 2026Updated 3 weeks ago
- ☆11Dec 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆24Oct 13, 2025Updated 5 months ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 9 months ago
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- A CLI tool you can pipe code and then ask for changes, add documentation, etc, using the OpenAI API.☆13Jan 5, 2024Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆10Aug 4, 2022Updated 3 years ago
- ☆18Sep 21, 2023Updated 2 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- moodist☆25Apr 3, 2026Updated last week
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- prompt engineering experiments with DSPy GEPA and TextGrad☆70Sep 2, 2025Updated 7 months ago
- LLM code editor for backend services☆16Oct 19, 2024Updated last year
- Abstraction and Reasoning Corpus☆14Nov 22, 2022Updated 3 years ago
- ☆15Dec 15, 2025Updated 3 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Nov 12, 2024Updated last year
- DSPY on action with OpenSource LLMs.☆105Apr 9, 2024Updated 2 years ago
- nyc is so back☆21Jun 27, 2025Updated 9 months ago
- The dataset and code for PeerSum at EMNLP'23.☆16Oct 20, 2025Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆449Feb 13, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆63May 8, 2025Updated 11 months ago
- A collection of implementations of fair ML algorithms☆12Jan 7, 2018Updated 8 years ago
- run deepseek v3 on a single node. Drops unused experts from memory.☆16Jan 26, 2025Updated last year
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated last year
- ☆13Jan 27, 2019Updated 7 years ago
- ☆17Feb 12, 2025Updated last year
- ☆20Mar 22, 2024Updated 2 years ago
- A Bert2Bert model which able to generate headlines!☆12Nov 16, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆16Jan 3, 2023Updated 3 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Feb 5, 2021Updated 5 years ago
- Simple GRPO scripts and configurations.☆58Feb 6, 2025Updated last year
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆17Oct 6, 2025Updated 6 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆132Oct 16, 2024Updated last year
- ☆29Jan 31, 2026Updated 2 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year