A framework for pitting LLMs against each other in an evolving library of games ⚔
☆35Apr 17, 2025Updated last year
Alternatives and similar repositories for ZeroSumEval
Users that are interested in ZeroSumEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Documentation for DSPy Library☆23Apr 24, 2026Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Apr 20, 2025Updated last year
- Leverage Large Language Models to generate and execute code dynamically through an intuitive and easy-to-use API!☆17Mar 2, 2024Updated 2 years ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆45Mar 20, 2026Updated last month
- ☆11Dec 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆24Oct 13, 2025Updated 6 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- A CLI tool you can pipe code and then ask for changes, add documentation, etc, using the OpenAI API.☆13Jan 5, 2024Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- ☆18Sep 21, 2023Updated 2 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Nov 28, 2021Updated 4 years ago
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- moodist☆27Apr 23, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Conversations with Search Engines☆14Jun 12, 2023Updated 2 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- LLM code editor for backend services☆16Oct 19, 2024Updated last year
- ☆15Dec 15, 2025Updated 4 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Nov 12, 2024Updated last year
- ☆10Aug 26, 2022Updated 3 years ago
- DSPY on action with OpenSource LLMs.☆106Apr 9, 2024Updated 2 years ago
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 4 years ago
- nyc is so back☆21Jun 27, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The dataset and code for PeerSum at EMNLP'23.☆16Oct 20, 2025Updated 6 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆450Feb 13, 2024Updated 2 years ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆63May 8, 2025Updated 11 months ago
- A collection of implementations of fair ML algorithms☆12Jan 7, 2018Updated 8 years ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated 2 years ago
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated last year
- 这是对基于大模型的多智能体系统论文的总结☆10Jun 23, 2024Updated last year
- ☆13Jan 27, 2019Updated 7 years ago
- ☆18Apr 26, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆16Jan 3, 2023Updated 3 years ago
- An evaluation toolbox for machine learning explanations☆16Jan 7, 2024Updated 2 years ago
- ☆26Nov 19, 2025Updated 5 months ago
- Iterative specification refinement tool: feeds your docs through GPT Pro Extended Reasoning via Oracle for multiple revision rounds until…☆57Mar 22, 2026Updated last month
- Facilitates Visual Representation of Sign Language Data and Glosses☆19May 16, 2025Updated 11 months ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Feb 5, 2021Updated 5 years ago
- Experimental studies of my paper "Sampling Techniques in Bayesian Target Encoding"☆12Dec 8, 2022Updated 3 years ago