fixie-ai / ai-benchmarksView external linksLinks
Benchmarking suite for popular AI APIs
☆88Feb 6, 2025Updated last year
Alternatives and similar repositories for ai-benchmarks
Users that are interested in ai-benchmarks are comparing it to the libraries listed below
Sorting:
- Website with current metrics on the fastest AI models.☆43Nov 13, 2024Updated last year
- Python client SDK for Ultravox.☆16Dec 10, 2025Updated 2 months ago
- 🍳 View the repository like github☆11Dec 4, 2019Updated 6 years ago
- A proof of concept attempt☆13Apr 18, 2021Updated 4 years ago
- ☆21Jan 27, 2026Updated 2 weeks ago
- Get best practice babel config of Ant financial.☆16May 7, 2020Updated 5 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆15Jul 22, 2025Updated 6 months ago
- Datasets for hackernews posts☆16Feb 17, 2022Updated 3 years ago
- A trivial wrapper around spf13/cobra to simplify some basic patterns☆21Oct 23, 2023Updated 2 years ago
- Jest preprocessor/transformer for Rust☆17Dec 7, 2022Updated 3 years ago
- Ant Design Changelog Editor☆17Aug 11, 2024Updated last year
- umi2.x插件,build结束后自动上传构建结果至远程服务器☆15Jun 9, 2020Updated 5 years ago
- umi plugin for integrating macaca-datahub, which is a GUI-style mock tool that can be used to replace umi's built-in mock solution.☆22Mar 27, 2022Updated 3 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- Easy package.json exports.☆30Apr 19, 2023Updated 2 years ago
- A datazoom slider plugin for G2.☆20Aug 9, 2019Updated 6 years ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆54May 3, 2025Updated 9 months ago
- ☆476Jan 10, 2024Updated 2 years ago
- (Deprecated) The tools used to build umi.☆25Sep 7, 2020Updated 5 years ago
- Select a one-, two-dimensional or irregular region using the mouse.☆24Dec 15, 2017Updated 8 years ago
- ☆33Feb 2, 2026Updated last week
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆114Mar 20, 2025Updated 10 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 9 months ago
- ☆30Sep 5, 2021Updated 4 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆29Jun 18, 2022Updated 3 years ago
- Flow-based data pre-processing for deep learning☆31Jan 6, 2021Updated 5 years ago
- Serlina binding for Egg.js☆32Nov 8, 2018Updated 7 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆135Feb 22, 2024Updated last year
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated 9 months ago
- LLMPerf is a library for validating and benchmarking LLMs☆1,084Dec 9, 2024Updated last year
- vLLM performance dashboard☆41Apr 26, 2024Updated last year
- 😍 use mobx-state-tree gracefully in umijs.☆34Aug 22, 2018Updated 7 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- A benchmark for emotional intelligence in large language models☆400Jul 26, 2024Updated last year
- A simple unified framework for evaluating LLMs☆261Apr 14, 2025Updated 9 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- ABench is an evolving open-source benchmark suite designed to rigorously evaluate and enhance Large Language Models (LLMs) on complex cro…☆24Sep 29, 2025Updated 4 months ago
- ☆22Dec 11, 2025Updated 2 months ago