Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆68Updated 3 months ago
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆295Updated 2 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆261Updated 11 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆238Updated last month
- LLMPerf is a library for validating and benchmarking LLMs☆1,081Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆167Updated last year
- Library for model distillation☆161Updated 5 months ago
- Convert different model APIs into the OpenAI API format out of the box.☆160Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆192Updated last month
- Clone of https://r.jina.ai which is deployable locally☆50Updated last year
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆212Updated 6 months ago
- Model Context Protocol Servers for Milvus☆214Updated last month
- E2M API, converting everything to markdown (LLM-friendly Format).☆139Updated last year
- 📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…☆285Updated 6 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆244Updated last week
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆132Updated 10 months ago
- 基于 Dify + Langfuse 的自动化评估服务☆88Updated 8 months ago
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆39Updated last year
- Deploy Dify on Kubernetes☆347Updated 3 weeks ago
- ☆395Updated this week
- A collection of RAG systems powered by LLM.☆216Updated 11 months ago
- ☆94Updated 7 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆133Updated 5 months ago
- SDK for Dify plugins☆123Updated this week
- LLM Inference benchmark☆433Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆78Updated last year
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆315Updated 6 months ago
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆169Updated 5 months ago
- A open version Manus.☆67Updated 10 months ago
- 一个LightRAG的API模拟器,用于在Openwebui中通过自带的Ollama接口使用LightRAG;通过对话时使用前缀,还可以实现lightrag的模式切换。☆30Updated last year
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆73Updated last year