Open sourced backend for Martian's LLM Inference Provider Leaderboard
☆21Aug 13, 2024Updated last year
Alternatives and similar repositories for leaderboard-backend
Users that are interested in leaderboard-backend are comparing it to the libraries listed below
Sorting:
- Benchmarks to capture important workloads.☆32Updated this week
- ☆36Feb 6, 2026Updated last month
- LLM Serving Performance Evaluation Harness☆83Feb 25, 2025Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A repository aimed at sharing links to climate-related resources.☆12Updated this week
- R package providing a data frame interface for EPA's air dispersion model AERMOD☆20Jun 21, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- ☆18Updated this week
- ☆15Oct 24, 2023Updated 2 years ago
- Python wrapper for the energy system optimization framework IESopt.☆18Mar 2, 2026Updated last week
- Production code the Fast Implementation of the Gaussian Puff Forward Atmospheric Model☆17Dec 1, 2025Updated 3 months ago
- This project defines a json ontology standard describing a power consumption measure in a given software/hardware context, noticeably in …☆15Mar 2, 2026Updated last week
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Official Implementation of "The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thou…☆14Jul 2, 2025Updated 8 months ago
- Code and Data for GlitchBench☆13Feb 27, 2024Updated 2 years ago
- ☆11Oct 15, 2022Updated 3 years ago
- Collatinus Python Lemmatizer☆10Jun 1, 2021Updated 4 years ago
- ☆11Nov 5, 2024Updated last year
- ☆12Nov 5, 2024Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- ☆12Mar 5, 2025Updated last year
- ☆14Nov 12, 2025Updated 3 months ago
- Resources used by all of the autometrics implementations☆14Dec 5, 2023Updated 2 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Data used in Climate Indicator Project figures and tables☆15Jun 26, 2025Updated 8 months ago
- ☆11Apr 17, 2023Updated 2 years ago
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Updated this week
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- MLOps community survey☆10Dec 19, 2022Updated 3 years ago
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- ☆11Dec 20, 2023Updated 2 years ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago