pandada8 / llm-inference-benchmarkLinks
LLM 推理服务性能测试
☆44Updated last year
Alternatives and similar repositories for llm-inference-benchmark
Users that are interested in llm-inference-benchmark are comparing it to the libraries listed below
Sorting:
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆268Updated 3 months ago
- LLM101n: Let's build a Storyteller 中文版☆135Updated last year
- ☆177Updated this week
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆124Updated last month
- ☆65Updated last week
- LLM Inference benchmark☆430Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆407Updated last week
- Accelerate inference without tears☆367Updated last month
- 通义千问VLLM推理部署DEMO☆620Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆437Updated 3 weeks ago
- ☆512Updated 2 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆76Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆352Updated last year
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆411Updated 3 months ago
- NLP 项目记录档案☆61Updated 7 months ago
- Inference code for LLaMA models☆127Updated 2 years ago
- ☆115Updated last year
- ☆235Updated last year
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆249Updated last year
- 青稞Talk☆161Updated last week
- Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.☆201Updated last week
- LLaMA Factory Document☆154Updated 2 weeks ago
- FlagEval is an evaluation toolkit for AI large foundation models.☆339Updated 6 months ago
- ☆368Updated this week
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆267Updated last year
- Train a 1B LLM with 1T tokens from scratch by personal☆753Updated 6 months ago
- 中文版 llm-numbers☆126Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆138Updated 11 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆476Updated 6 months ago