morsoli / llmbenchmarkLinks
大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标
☆19Updated 11 months ago
Alternatives and similar repositories for llmbenchmark
Users that are interested in llmbenchmark are comparing it to the libraries listed below
Sorting:
- Bert TensorRT模型加速部署☆10Updated 3 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Updated 6 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆22Updated 11 months ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Updated 5 months ago
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- 使用mnn-llm对GOT-OCR2.0进行推理☆15Updated 10 months ago
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- ☆27Updated 2 months ago
- HunyuanDiT with TensorRT and libtorch☆17Updated last year
- Whisper in TensorRT-LLM☆16Updated last year
- run ChatGLM2-6B in BM1684X