Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆27Updated 4 months ago
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆226Updated this week
- Convert different model APIs into the OpenAI API format out of the box.☆153Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆124Updated 2 weeks ago
- Library for model distillation☆144Updated 4 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 5 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆153Updated 11 months ago
- LLM Inference benchmark☆421Updated 11 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆194Updated this week
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆152Updated 8 months ago
- Using GPT to parse PDF☆98Updated 9 months ago
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆72Updated last week
- 基于 Dify + Langfuse 的自动化评估服务☆68Updated 3 weeks ago
- bisheng-unstructured library☆51Updated last month
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆23Updated 9 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆108Updated 2 months ago
- An OpenAI API-compatible middleware for Qwen OpenAI API, implementing (stream) tool calling functionality☆9Updated 10 months ago
- ☆90Updated 3 months ago
- Qwen GRPO Graph Extraction RL Finetune☆49Updated 2 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆175Updated this week
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆32Updated 6 months ago
- A fluent, scalable, and easy-to-use LLM data processing framework.☆21Updated 2 weeks ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- Clone of https://r.jina.ai which is deployable locally☆44Updated 9 months ago
- AutoHub: A Personal Browser Automation Assistant☆22Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated last year
- run DeepSeek-R1 GGUFs on KTransformers☆236Updated 3 months ago
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆48Updated 11 months ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆52Updated this week
- ☆39Updated this week
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆82Updated last year