pandada8 / llm-inference-benchmarkLinks
LLM 推理服务性能测试
☆44Updated last year
Alternatives and similar repositories for llm-inference-benchmark
Users that are interested in llm-inference-benchmark are comparing it to the libraries listed below
Sorting:
- LLM101n: Let's build a Storyteller 中文版☆132Updated last year
- ☆174Updated last week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆264Updated last month
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆102Updated 2 weeks ago
- ☆60Updated last week
- 通义千问VLLM推理部署DEMO☆603Updated last year
- LLM Inference benchmark☆426Updated last year
- ☆114Updated 10 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆339Updated last year
- Accelerate inference without tears☆323Updated 6 months ago
- ☆135Updated 7 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆70Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆422Updated last week
- FlagScale is a large model toolkit based on open-sourced projects.☆353Updated last week
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆290Updated 2 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆407Updated 3 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆732Updated 4 months ago
- ☆353Updated last year
- ☆231Updated last year
- ☆353Updated this week
- Inference code for LLaMA models☆123Updated 2 years ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆471Updated 4 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆189Updated last month
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆211Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆257Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆57Updated last year
- TinyRAG☆336Updated 2 months ago
- ☆497Updated last week
- ☆50Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆137Updated 9 months ago