Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆49Updated 3 weeks ago
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆286Updated 2 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆212Updated 2 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆254Updated 8 months ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆200Updated 3 months ago
- Clone of https://r.jina.ai which is deployable locally☆48Updated last year
- Convert different model APIs into the OpenAI API format out of the box.☆160Updated last year
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆38Updated 11 months ago
- 📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…☆255Updated 3 months ago
- Library for model distillation☆153Updated last month
- 基于 Dify + Langfuse 的自动化评估服务☆84Updated 5 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆169Updated 3 months ago
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆319Updated 4 months ago
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant☆57Updated last year
- A open version Manus.☆67Updated 7 months ago
- Using GPT to parse PDF☆101Updated last year
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆138Updated 2 months ago
- Data browser based on s3. 一个基于 S3 的数据(json / jsonl / html / md等)可视化工具。👇 Try online.☆77Updated last week
- BGE-large Embeddings api by FastAPI☆43Updated last year
- AI Proxy is a high-performance AI gateway using OpenAI's and Claude protocol as the entry point. It features intelligent error handling, …☆219Updated last week
- MinerU API server☆78Updated 10 months ago
- Open-source observability for your LLM application.☆52Updated 10 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆128Updated 7 months ago
- Intelligent data apps and assets with LLMs☆167Updated 7 months ago
- ☆23Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆216Updated this week
- PSDify: A PowerShell module for workspace management for Dify, featuring various cmdlets for managing Apps, Knowledges, Models, and Membe…☆21Updated this week
- LLM Inference benchmark☆428Updated last year
- A powerful tool for creating high-quality training datasets for Large Language Models (LLMs)(一个快速生成高质量LLM微调训练数据集的工具)☆131Updated 2 months ago
- Model Context Protocol Servers for Milvus☆190Updated last week
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆315Updated 3 months ago