coder543/llm-speed-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/coder543/llm-speed-benchmark)

coder543 / llm-speed-benchmark

A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API

☆25

Alternatives and similar repositories for llm-speed-benchmark

Users that are interested in llm-speed-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Nero10578 / LLM-Inference-Benchmark
View on GitHub
☆14Aug 25, 2024Updated last year
charmandercha / ArchiDoc
View on GitHub
☆16Dec 16, 2024Updated last year
FarFetchd / sleepyllama
View on GitHub
an auto-sleeping and -waking framework around llama.cpp
☆13Feb 8, 2025Updated last year
kseyhan / llama-param-pal
View on GitHub
☆12May 30, 2025Updated last year
Nyralei / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆12Aug 1, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
datobs / react-native-perspective-image-cropper
View on GitHub
Perform custom crop, resizing and perspective correction 📐🖼
☆11May 9, 2025Updated last year
sasha0552 / nvidia-pstate
View on GitHub
A library and CLI utilities for managing performance states of NVIDIA GPUs.
☆37Oct 6, 2024Updated last year
jxqu3 / aiui
View on GitHub
A simple no-install web UI for Ollama and OAI-Compatible APIs!
☆31Jan 30, 2025Updated last year
transferwise / wise-topic
View on GitHub
LLM-only topic extraction and classification
☆11Jun 3, 2026Updated last month
oteroantoniogom / Unsloth-VLLM-RTX5090-Ubuntu
View on GitHub
Automated bash script to set up a high-performance environment on Ubuntu Linux with RTX5090, including installations of PyTorch, Unsloth,…
☆18Apr 1, 2025Updated last year
keeeeenw / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆14Mar 30, 2024Updated 2 years ago
extrawest / ev_stations_map_showcase
View on GitHub
This project is an app that shows a map with Electric Charging Stations and their information. The app supports station markers clusterin…
☆12Jan 15, 2024Updated 2 years ago
Write-with-LAIKA / drama-engine
View on GitHub
A Framework for Narrative Agents
☆42Mar 2, 2026Updated 4 months ago
shivamsanju / ragswift
View on GitHub
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Jan 29, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
SeungyounShin / agent-verify
View on GitHub
A systematic empirical study of self-verification strategies in agentic coding harnesses
☆29Mar 4, 2026Updated 4 months ago
jryebread / LLMBenchMark
View on GitHub
Like system requirements lab but for LLMs
☆31Jun 10, 2023Updated 3 years ago
xTimeCrystal / MiniModel
View on GitHub
☆42Feb 25, 2026Updated 5 months ago
jacobmarks / huggingface-fiftyone-converters
View on GitHub
Convert datasets from Hugging Face to FiftyOne for Visualization
☆11Mar 15, 2024Updated 2 years ago
koji / pnpm-package-template-with-tsup
View on GitHub
npm package template with typescript and tsup
☆11Nov 27, 2025Updated 7 months ago
surendhar153 / one-tap-google-sign-in
View on GitHub
Allows users to add Google One Tap Sign-in or Sign-up to wordpress website.
☆17Jun 7, 2026Updated last month
Nondzu / LlamaTor
View on GitHub
LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.
☆65Jan 5, 2025Updated last year
alicfeng / kubernetes_cicd
View on GitHub
For the better CI as well as CD using gogs and drone base on kubernetes
☆10Jul 31, 2021Updated 4 years ago
TengHu / AutoCoder
View on GitHub
☆11Jan 28, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bsamud / openfoundry-agentic-framework
View on GitHub
Multi-agent orchestration framework for AI applications - build, deploy, and manage AI agents across the full lifecycle with Forge, Conve…
☆33Mar 28, 2026Updated 3 months ago
Pxplore / pxplore-algo
View on GitHub
A modular system for personalized learning exploration.
☆15Oct 10, 2025Updated 9 months ago
MatN23 / AdaptiveTrainingSystem
View on GitHub
A PyTorch framework for training transformer language models with Mixture of Experts (MoE) architecture support, Mixture of Depths (MoD),…
☆21Updated this week
and270 / thinking_effort_processor
View on GitHub
☆93Jul 7, 2025Updated last year
TomokiMiyauci / mapcss
View on GitHub
Tiny, composable Atomic CSS engine
☆13Apr 1, 2022Updated 4 years ago
matatonic / openedai-images
View on GitHub
An OpenAI API compatible images server to generate or manipulate images.
☆18Feb 2, 2025Updated last year
ryanermita / apache-logs-analyzer
View on GitHub
A simple apache logs analyzer.
☆18Oct 1, 2017Updated 8 years ago
bashalarmist / hello-ooba
View on GitHub
Oobabooga "Hello World" API example for node.js with Express
☆13Jul 2, 2023Updated 3 years ago
Lanerra / saga
View on GitHub
Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.
☆108Feb 16, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
uclnlp / APE
View on GitHub
Adaptive Passage Encoder for Open-domain Question Answering
☆15Jun 1, 2021Updated 5 years ago
edenhaus / bumper
View on GitHub
A standalone and self-hosted implementation of the central server used by Ecovacs vacuum robots.
☆19Sep 9, 2025Updated 10 months ago
TECHS-Technological-Solutions / ocpp-simulator
View on GitHub
☆17Aug 29, 2022Updated 3 years ago
Miserlou / SynthRecipies
View on GitHub
Random Serum Patches
☆17Apr 21, 2018Updated 8 years ago
sammcj / moa
View on GitHub
Mixture-of-Ollamas
☆31Aug 12, 2024Updated last year
j-min / WikiExtractor_To_the_one_text
View on GitHub
Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)
☆16Dec 23, 2016Updated 9 years ago
hallogameboy / QDS-Transformer
View on GitHub
☆16Sep 28, 2020Updated 5 years ago