Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆62Updated last month
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆294Updated last month
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆221Updated 4 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆258Updated 9 months ago
- Convert different model APIs into the OpenAI API format out of the box.☆160Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆242Updated last week
- LLM Inference benchmark☆430Updated last year
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆321Updated 6 months ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆209Updated 5 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆184Updated 3 weeks ago
- 📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…☆268Updated 5 months ago
- LLMPerf is a library for validating and benchmarking LLMs☆1,068Updated last year
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,357Updated this week
- Library for model distillation☆158Updated 3 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆164Updated last year
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆156Updated 4 months ago
- E2M API, converting everything to markdown (LLM-friendly Format).☆138Updated last year
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆131Updated 8 months ago
- Agents of C.L.I.☆144Updated 3 months ago
- ☆133Updated 8 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆262Updated 8 months ago
- 基于 Dify + Langfuse 的自动化评估服务☆85Updated 6 months ago
- Clone of https://r.jina.ai which is deployable locally☆49Updated last year
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆38Updated last year
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆316Updated 5 months ago
- ☆383Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆270Updated 4 months ago
- The driver for LMCache core to run in vLLM☆59Updated 10 months ago
- Comparison of Language Model Inference Engines☆238Updated last year
- ☆108Updated 2 weeks ago
- Chat2Graph: Graph Native Agentic System.☆374Updated last month