facebookresearch/ZeroSumEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/ZeroSumEval)

facebookresearch / ZeroSumEval

A framework for pitting LLMs against each other in an evolving library of games ⚔

☆35

Alternatives and similar repositories for ZeroSumEval

Users that are interested in ZeroSumEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZeroSumEval / ZeroSumEval
View on GitHub
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆35Apr 17, 2025Updated last year
aniket-work / AI_Powered_Dev_Search_Engine
View on GitHub
AI_Powered_Dev_Search_Engine
☆12Mar 10, 2024Updated 2 years ago
ARBML / Taqyim
View on GitHub
Python intefrace for evaluation on chatgpt models
☆19Feb 13, 2024Updated 2 years ago
calmstate / Itinerant
View on GitHub
A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.
☆19Aug 30, 2024Updated last year
UBC-NLP / peacock
View on GitHub
This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.
☆26Dec 9, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
justarter / E2URec
View on GitHub
Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…
☆38Jul 19, 2024Updated 2 years ago
hetailang / SqueezeAttention
View on GitHub
☆37Oct 10, 2024Updated last year
prs-eth / LoRA-Ensemble
View on GitHub
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks
☆55Mar 7, 2026Updated 4 months ago
HaroldChen19 / VistaDPO
View on GitHub
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆41Jun 14, 2025Updated last year
shuzhangzhong / HybriMoE-Preview
View on GitHub
☆17Apr 9, 2025Updated last year
open-compass / Ada-LEval
View on GitHub
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
☆56May 22, 2025Updated last year
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year
YoungDubbyDu / LLM-based-Multi-Agent-Systems
View on GitHub
这是对基于大模型的多智能体系统论文的总结
☆10Jun 23, 2024Updated 2 years ago
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
git-disl / Virus
View on GitHub
This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
☆56Feb 2, 2025Updated last year
hustvl / MIM4D
View on GitHub
[IJCV 2025] MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
☆78May 30, 2025Updated last year
aws-samples / amazon-isv-plug-n-play
View on GitHub
☆10Apr 26, 2023Updated 3 years ago
mznmel / Pico-Saudi-LLMs-Benchmark
View on GitHub
أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs
☆18Jan 22, 2025Updated last year
embeddings-benchmark / arena
View on GitHub
Code for the MTEB Arena
☆25Jul 2, 2025Updated last year
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
smy20011 / MorningRadio
View on GitHub
Generate Your Own Private Morning Radio for Commute
☆33Feb 5, 2025Updated last year
SijiaCui / play-urts
View on GitHub
☆15Oct 28, 2024Updated last year
moucheng2017 / SOP-LVM-ICL-Ensemble
View on GitHub
[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…
☆23Mar 16, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ianhohoho / auto-hyde
View on GitHub
🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…
☆38Mar 26, 2024Updated 2 years ago
HannahKirk / prism-alignment
View on GitHub
The Prism Alignment Project
☆93Apr 25, 2024Updated 2 years ago
PasiKoodaa / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆32May 1, 2025Updated last year
juyeonnn / KGMEL
View on GitHub
[SIGIR'25 Short] Official Repository of "KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking"
☆29Dec 17, 2025Updated 7 months ago
brownirl / rlang
View on GitHub
A Declarative Language for Expressing Partial World Knowledge to Reinforcement Learning Agents
☆17Jan 19, 2024Updated 2 years ago
ghi-electronics / TinyCLR-Samples
View on GitHub
Sample projects and demos for TinyCLR OS
☆15Jul 17, 2026Updated last week
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
ddw2AIGROUP2CQUPT / PA-LLaVA
View on GitHub
A Large Language-Vision Assistant for Pathology Image Understanding (BIBM-2024 & Journal of Artificial Intelligence Review 2025)
☆65Jun 18, 2025Updated last year
VILA-Lab / DELT
View on GitHub
(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…
☆28Aug 23, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
quackingduck / rld
View on GitHub
Commandline utility for OSX that reloads the frontmost browser tab
☆11Jan 18, 2016Updated 10 years ago
Babelscape / LLM-Oasis
View on GitHub
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…
☆25Oct 15, 2025Updated 9 months ago
jalbrethsen / double-agent
View on GitHub
☆12Aug 1, 2025Updated 11 months ago
foxglove / ros1
View on GitHub
Standalone TypeScript implementation of the ROS 1 protocol with a pluggable transport layer
☆15Jan 8, 2025Updated last year
Brett-Kennedy / PRISM-Rules
View on GitHub
A rules induction system for data mining and exploratory data analysis
☆11Jul 17, 2024Updated 2 years ago
vlinx-io / infinite-search
View on GitHub
AI Search Engine Development with SpringBoot and Langchain4J, based on search-with-lepton project
☆41Feb 2, 2024Updated 2 years ago
ConiferLM / Conifer
View on GitHub
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
☆91Apr 4, 2024Updated 2 years ago