Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆39Updated last week
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆205Updated last month
- LM inference server implementation based on *.cpp.☆273Updated last month
- run DeepSeek-R1 GGUFs on KTransformers☆251Updated 6 months ago
- Convert different model APIs into the OpenAI API format out of the box.☆159Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆161Updated 2 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆160Updated last year
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆194Updated 2 months ago
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆38Updated 9 months ago
- ☆104Updated this week
- ☆92Updated 2 months ago
- 📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…☆249Updated 2 months ago
- ☆103Updated 2 months ago
- SDK for Dify plugins☆90Updated this week
- Library for model distillation☆151Updated 2 weeks ago
- 基于 Dify + Langfuse 的自动化评估服务☆82Updated 3 months ago
- Using GPT to parse PDF☆101Updated last year
- Handy tool to measure the performance and efficiency of LLMs workloads.☆71Updated 4 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆261Updated 5 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆72Updated last year
- ☆47Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆27Updated 3 months ago
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆128Updated last month
- MCP Server for SearXNG☆213Updated last week
- Model Context Protocol Servers for Milvus☆177Updated 3 months ago
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆62Updated 11 months ago
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆321Updated 3 months ago
- Turn a web server into an MCP server in one click without making any code changes.☆124Updated 2 weeks ago
- PSDify: A PowerShell module for workspace management for Dify, featuring various cmdlets for managing Apps, Knowledges, Models, and Membe…☆21Updated 2 weeks ago
- Clone of https://r.jina.ai which is deployable locally☆48Updated last year
- dify-connector is a tool to publish Dify apps to various IM platforms. | dify-connector 是一个将 Dify 发布到各种 IM 平台的工具。☆99Updated last year