anarchy-ai / llm-speed-benchmarkLinks

Benchmarking tool for assessing LLM models' performance across different hardwares

☆17

Alternatives and similar repositories for llm-speed-benchmark

Users that are interested in llm-speed-benchmark are comparing it to the libraries listed below

Sorting:

IBM / vllm
vLLM with support for span semantics
☆18Updated last week
furiousteabag / vram-calculator
Transformer GPU VRAM estimator
☆66Updated last year
mistralai / vllm-release
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52Updated last year
teknium1 / LLM-Logbook
Public reports detailing responses to sets of prompts by Large Language Models.
☆31Updated 9 months ago
nhtlongcs / StarListify
StarListify is a Python package that classifies GitHub stars history into organized category lists based on user-defined criteria.
☆28Updated 11 months ago
irthomasthomas / undecidability
☆22Updated last year
groq / realtime-eval
Realtime News and Information Eval
☆14Updated last month
mobiusml / aana_sdk
Aana SDK is a powerful framework for building AI enabled multimodal applications.
☆52Updated last month
facebookresearch / fastgen
Simple high-throughput inference library
☆146Updated 5 months ago
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆10Updated last year
ajtejankar / mixtral-vis-moe
Visualize expert firing frequencies across sentences in the Mixtral MoE model
☆18Updated last year
exo-explore / gym
EXO Gym is an open-source Python toolkit that facilitates distributed AI research.
☆78Updated last month
fixie-ai / llm-frameworks
☆19Updated last year
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆213Updated last year
fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆87Updated 8 months ago
anyscale / llm-router
Tutorial for building LLM router
☆230Updated last year
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆83Updated last week
zhuzilin / faster-nougat
Implementation of nougat that focuses on processing pdf locally.
☆83Updated 9 months ago
deepinfra / deepctl
Command line tool for Deep Infra cloud ML inference service
☆33Updated last year
virevolai / logos-shift-client
Replace expensive LLM calls with finetunes automatically
☆64Updated last year
premAI-io / Ayup
Quickly and securely turn any Linux box into a build and deployment assistant
☆24Updated last year
log10-io / log10
Python client library for improving your LLM app accuracy
☆97Updated 8 months ago
QuixiAI / OpenChatML
☆162Updated 2 months ago
logikon-ai / logikon
Analyzing and scoring reasoning traces of LLMs
☆46Updated last year
BBischof / yapping
Verbosity control for AI agents
☆65Updated last year
friendliai / friendli-client
[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI
☆48Updated 3 months ago
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆91Updated 7 months ago
thisisanshgupta / Senna
Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…
☆19Updated last year
swyxio / openlangmem
☆47Updated last year
cfahlgren1 / hf-data-explorer
Chrome Extension for exploring Hugging Face datasets 🔎
☆48Updated last year