LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。
☆217Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for llm-benchmark
Users that are interested in llm-benchmark are comparing it to the libraries listed below
Sorting:
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆23Feb 4, 2025Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆42Jul 10, 2024Updated last year
- ☆20Dec 29, 2023Updated 2 years ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆42Oct 28, 2024Updated last year
- ☆14Feb 9, 2026Updated 3 weeks ago
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速 推理和INT8量化实现☆13Mar 5, 2025Updated last year
- 这是一个使用opencv读取视频并使用socket进行传输视频画面的脚本文件,相较于调用ffmpeg传输节约了90%的数据量☆11May 14, 2024Updated last year
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 3 months ago
- ☆15Dec 10, 2025Updated 2 months ago
- LLM智能路由网关、 Enterprise Intelligent AI-API Distribution Gateway☆13Jan 24, 2025Updated last year
- 大模型推理压测☆46Jul 31, 2025Updated 7 months ago
- a simple lightweight large language model pipeline framework.☆28Apr 25, 2025Updated 10 months ago
- Community maintained hardware plugin for vLLM on Ascend☆1,711Feb 28, 2026Updated last week
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- SuperPoint and LightGlue with TensorRT. Deploy with C++.☆22Dec 14, 2023Updated 2 years ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- Co-DETR (Detection Transformer) compiled from PyTorch to NVIDIA TensorRT☆20Apr 19, 2025Updated 10 months ago
- An open-source toolkit helping developers build natural language database query solutions☆27May 5, 2025Updated 10 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Nov 21, 2024Updated last year
- 阿里云天池 - GLM 法律行业大模型挑战赛 - 我们小组实现基于大模型的对话机器人源码☆17Oct 23, 2024Updated last year
- ☆11Feb 25, 2026Updated last week
- 一个轻量化的大模型推理框架☆21May 26, 2025Updated 9 months ago
- Vue Arco Project是一个基于Vue 3、Vite和Arco Design的现代化前端应用框架,提供了丰富的UI组件和便捷的开发体验,帮助开发者快速构建高品质的Web应用。☆20Apr 18, 2025Updated 10 months ago
- 学习vLLM,使用vLLM部署Qwen2-0.5B的模型,并使用docker部署。☆19Jun 22, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- Dify-Plus 是 Dify 的企业级增强版,集成了基于 gin-vue-admin 的管理中心,并针对企业场景进行了功能优化。 🚀 Dify-Plus = 管理中心 + Dify 二开 。 特别说明: 本项目为开源社区的二次开发成果,严格遵循 Dify 原项目的版…☆2,050Feb 27, 2026Updated last week
- 高性能 高精度 大陆车牌、港澳车牌、台湾车牌 韩国车牌(South Korea LPR)识别 代码开源(ncnn移植)☆41Nov 5, 2025Updated 4 months ago
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,436Feb 25, 2026Updated last week
- Awesome code, projects, books, etc. related to CUDA☆31Feb 3, 2026Updated last month
- Real time faster whisper gradio☆25Aug 17, 2025Updated 6 months ago
- Ranking and Covariances for Practical Learned Keypoints☆75Updated this week
- Protective hooks for Claude Code that prevent accidental code loss through branch protection, automatic checkpointing, and safe commit …☆48Sep 15, 2025Updated 5 months ago
- ☆26Nov 21, 2024Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆56Updated this week
- 01. Enabling various applications to be AI-enabled or used by AI.☆27Apr 25, 2024Updated last year
- ☆26Aug 15, 2023Updated 2 years ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago