Yoosu-L / llmapibenchmarkLinks

The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.

☆31

Alternatives and similar repositories for llmapibenchmark

Users that are interested in llmapibenchmark are comparing it to the libraries listed below

Sorting:

gpustack / llama-box
LM inference server implementation based on *.cpp.
☆236Updated this week
gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆134Updated last week
hargup / reader
Clone of https://r.jina.ai which is deployable locally
☆45Updated 10 months ago
intergalacticalvariable / reader
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…
☆232Updated 9 months ago
limcheekin / open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
☆155Updated last year
horus-ai-labs / DistillFlow
Library for model distillation
☆146Updated 5 months ago
gpustack / gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
☆188Updated this week
muyuworks / myla
a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant
☆55Updated 10 months ago
wade1010 / graphrag-ui
The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation
☆154Updated 9 months ago
ubergarm / r1-ktransformers-guide
run DeepSeek-R1 GGUFs on KTransformers
☆242Updated 4 months ago
JiangYain / Dify_Pipeline_OpenwebUI
通过该项目将Dify通过Pipeline接入OpenwebUI，可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。
☆35Updated 7 months ago
hustyichi / dify-eval
基于 Dify + Langfuse 的自动化评估服务
☆70Updated last month
Jing-yilin / E2M
E2M API, converting everything to markdown (LLM-friendly Format).
☆135Updated 7 months ago
daodao97 / gptpdf-ui
Using GPT to parse PDF
☆99Updated 10 months ago
soulteary / amazing-openai-api
Convert different model APIs into the OpenAI API format out of the box.
☆156Updated last year
win4r / o1
Using Groq or OpenAI or Ollama to create o1-like reasoning chains
☆297Updated 10 months ago
hwchase17 / auto-openai-prompter
☆249Updated last year
InternLM / OpenAOE
LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架：同时与多个大语言模型聊天。
☆307Updated 3 weeks ago
rainchen / dify-tool-LongTermMemory
a Dify tool for storing and retrieving long-term-memory, using Dify built-in Knowledge dataset for storing memories, each user has a stan…
☆83Updated 11 months ago
xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆45Updated last week
GenseeAI / cognify
Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…
☆243Updated 2 months ago
KylinMountain / graphrag-server
添加🚀流式 Web 服务到 GraphRAG，兼容 OpenAI SDK，支持可访问的实体链接🔗，支持建议问题，兼容本地嵌入模型，修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…
☆255Updated 3 months ago
tensorchord / modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆275Updated last year
essamamdani / search-result-scraper-markdown
This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Sear…
☆224Updated 6 months ago
ParisNeo / ollama_proxy_server
A proxy server for multiple ollama instances with Key security
☆462Updated last week
sugarforever / peanut-shell
☆53Updated 7 months ago
dashscope / dash-cookbook
Receipts for creating AI Applications with APIs from DashScope (and friends)!
☆58Updated 9 months ago
eosphoros-ai / DB-GPT-Plugins
Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.
☆103Updated last year
thad0ctor / llama-server-launcher
☆101Updated this week
NexaAI / octopus-v4
AI for all: Build the large graph of the language models
☆270Updated last year