A framework for pitting LLMs against each other in an evolving library of games ⚔
☆35Apr 17, 2025Updated 10 months ago
Alternatives and similar repositories for ZeroSumEval
Users that are interested in ZeroSumEval are comparing it to the libraries listed below
Sorting:
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆27Mar 6, 2024Updated last year
- Official Documentation for DSPy Library☆21Updated this week
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆44Feb 5, 2026Updated 3 weeks ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated last year
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Efficient vector database for hundred millions of embeddings.☆212May 17, 2024Updated last year
- DSPY on action with OpenSource LLMs.☆104Apr 9, 2024Updated last year
- ☆27Oct 30, 2023Updated 2 years ago
- prompt engineering experiments with DSPy GEPA and TextGrad☆67Sep 2, 2025Updated 5 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆74Nov 4, 2025Updated 3 months ago
- ☆63Dec 21, 2024Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆449Feb 13, 2024Updated 2 years ago
- A framework for optimizing DSPy programs with RL☆318Jan 12, 2026Updated last month
- ☆29Oct 24, 2025Updated 4 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆132Oct 16, 2024Updated last year
- A programming language for formal/informal computation.☆43Dec 31, 2025Updated 2 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- A passion project on my favorite e-commerce site that scrapes product data and builds a recommendation engine☆10May 2, 2023Updated 2 years ago
- openASO is a project designed to identify regulatory regions of an RNA that can be targeted by antisense oligonucleotides.☆10Sep 30, 2021Updated 4 years ago
- Simple Graph Memory for AI applications☆90Updated this week
- ☆27Updated this week
- Go SDK for the Bare Metal Cloud API☆14Dec 20, 2025Updated 2 months ago
- TOON as DSPy adapter☆25Feb 1, 2026Updated last month
- ☆11Dec 23, 2023Updated 2 years ago
- ☆17Jun 8, 2025Updated 8 months ago
- Implementation of a Systolic Array based sorting engine on an FPGA using Verilog☆11May 11, 2017Updated 8 years ago
- ☆12Dec 26, 2023Updated 2 years ago
- lncRNA-Py is a development package for applying machine learning and deep learning to the problem of lncRNA classification, i.e. predicti…☆12Jan 24, 2025Updated last year
- Modern Methods of Applied Statistics (Spring 2023) STAT 34800☆10May 20, 2023Updated 2 years ago
- GALL.AI (prev. Generall.AI) - Telegram Advanced AI Agent System Chat Bot☆14Feb 7, 2026Updated 3 weeks ago
- ☆13Nov 5, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)☆48Dec 27, 2021Updated 4 years ago
- ☆25Jan 30, 2026Updated last month
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- ☆12Jan 11, 2026Updated last month
- This software uses a config file (config.py), which is a settings file, to build and run SWAT+ models. Users can share the config along w…☆16Mar 9, 2022Updated 3 years ago
- Optimization in python☆11Oct 5, 2018Updated 7 years ago