pandada8 / llm-inference-benchmarkLinks

LLM 推理服务性能测试

☆44

Alternatives and similar repositories for llm-inference-benchmark

Users that are interested in llm-inference-benchmark are comparing it to the libraries listed below

Sorting:

modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆268Updated 3 months ago
SmartFlowAI / LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
☆135Updated last year
mindspore-lab / mindformers
☆177Updated this week
hyperai / vllm-cn
vLLM Documentation in Chinese Simplified / vLLM 中文文档
☆124Updated last month
DeepLink-org / dlinfer
☆65Updated last week
ninehills / llm-inference-benchmark
LLM Inference benchmark
☆430Updated last year
flagos-ai / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆407Updated last week
alipay / PainlessInferenceAcceleration
Accelerate inference without tears
☆367Updated last month
owenliang / qwen-vllm
通义千问VLLM推理部署DEMO
☆620Updated last year
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆437Updated 3 weeks ago
Tencent / KsanaLLM
☆512Updated 2 months ago
AI-Study-Han / Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆76Updated last year
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆352Updated last year
InternLM / InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…
☆411Updated 3 months ago
wangwenju269 / work_space
NLP 项目记录档案
☆61Updated 7 months ago
sunkx109 / llama
Inference code for LLaMA models
☆127Updated 2 years ago
hengjiUSTC / learn-llm
☆115Updated last year
Chinese-Tiny-LLM / Chinese-Tiny-LLM
☆235Updated last year
inferflow / inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
☆249Updated last year
qingkelab / qingketalk
青稞Talk
☆161Updated last week
Tencent / AngelSlim
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
☆201Updated last week
the-seeds / LLaMA-Factory-Doc
LLaMA Factory Document
☆154Updated 2 weeks ago
flageval-baai / FlagEval
FlagEval is an evaluation toolkit for AI large foundation models.
☆339Updated 6 months ago
david-xinyuwei / david-share
☆368Updated this week
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆192Updated last year
AI-Study-Han / Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
☆267Updated last year
zhanshijinwat / Steel-LLM
Train a 1B LLM with 1T tokens from scratch by personal
☆753Updated 6 months ago
NascentCore / llm-numbers-cn
中文版 llm-numbers
☆126Updated last year
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆138Updated 11 months ago
jiahe7ay / MINI_LLM
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
☆476Updated 6 months ago