Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆31Updated 4 months ago
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆236Updated this week
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆134Updated last week
- Clone of https://r.jina.ai which is deployable locally☆45Updated 10 months ago
- 📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…☆232Updated 9 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆155Updated last year
- Library for model distillation☆146Updated 5 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆188Updated this week
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant☆55Updated 10 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆154Updated 9 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆242Updated 4 months ago
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆35Updated 7 months ago
- 基于 Dify + Langfuse 的自动化评估服务☆70Updated last month
- E2M API, converting everything to markdown (LLM-friendly Format).☆135Updated 7 months ago
- Using GPT to parse PDF☆99Updated 10 months ago
- Convert different model APIs into the OpenAI API format out of the box.☆156Updated last year
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆297Updated 10 months ago
- ☆249Updated last year
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆307Updated 3 weeks ago
- a Dify tool for storing and retrieving long-term-memory, using Dify built-in Knowledge dataset for storing memories, each user has a stan…☆83Updated 11 months ago
- xllamacpp - a Python wrapper of llama.cpp☆45Updated last week
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆243Updated 2 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆255Updated 3 months ago
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆275Updated last year
- This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Sear…☆224Updated 6 months ago
- A proxy server for multiple ollama instances with Key security☆462Updated last week
- ☆53Updated 7 months ago
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆58Updated 9 months ago
- Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.☆103Updated last year
- ☆101Updated this week
- AI for all: Build the large graph of the language models☆270Updated last year