Yoosu-L / llmapibenchmarkLinks
The LLM API Benchmark Tool is a flexible Go-based utility designed to measure and analyze the performance of OpenAI-compatible API endpoints across different concurrency levels.
☆63Updated 2 months ago
Alternatives and similar repositories for llmapibenchmark
Users that are interested in llmapibenchmark are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆294Updated last month
- run DeepSeek-R1 GGUFs on KTransformers☆259Updated 10 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆232Updated 2 weeks ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆190Updated 3 weeks ago
- Model Context Protocol Servers for Milvus☆212Updated 3 weeks ago
- Open Source Text Embedding Models with OpenAI Compatible API☆165Updated last year
- LLMPerf is a library for validating and benchmarking LLMs☆1,080Updated last year
- Library for model distillation☆160Updated 4 months ago
- 🎉 An awesome & curated list of best LLMOps tools.☆184Updated last week
- LLM Inference benchmark☆432Updated last year
- ☆94Updated 6 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆810Updated this week
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆78Updated last year
- 📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simp…☆276Updated 6 months ago
- Clone of https://r.jina.ai which is deployable locally☆50Updated last year
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆162Updated 4 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆132Updated 5 months ago
- 一个LightRAG的API模拟器,用于在Openwebui中通过自带的Ollama接口使用LightRAG;通过对话时使用前缀,还可以实现lightrag的模式切换。☆28Updated last year
- Convert different model APIs into the OpenAI API format out of the box.☆160Updated last year
- A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes…☆331Updated this week
- E2M API, converting everything to markdown (LLM-friendly Format).☆138Updated last year
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆266Updated 8 months ago
- Verify Precision of all Kimi K2 API Vendor☆494Updated 2 weeks ago
- ☆26Updated 2 weeks ago
- a huggingface mirror site.☆326Updated last year
- Handy tool to measure the performance and efficiency of LLMs workloads.☆74Updated 8 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆243Updated last week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆516Updated last year
- ☆180Updated last year
- ☆92Updated last year