pandada8 / llm-inference-benchmarkLinks
LLM 推理服务性能测试
☆44Updated last year
Alternatives and similar repositories for llm-inference-benchmark
Users that are interested in llm-inference-benchmark are comparing it to the libraries listed below
Sorting:
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆270Updated 4 months ago
- LLM101n: Let's build a Storyteller 中文版☆135Updated last year
- ☆66Updated last week
- ☆179Updated last week
- LLM Inference benchmark☆431Updated last year
- 通义千问VLLM推理部署DEMO☆627Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆132Updated 2 weeks ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关 的各种技术、原理和应用。☆355Updated last year
- Inference code for LLaMA models☆128Updated 2 years ago
- Accelerate inference without tears☆370Updated 3 weeks ago
- FlagScale is a large model toolkit based on open-sourced projects.☆421Updated last week
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆480Updated 7 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆415Updated 3 months ago
- ☆235Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆59Updated last year
- ☆515Updated 3 weeks ago
- ☆360Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆442Updated last month
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆194Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆213Updated last year
- qwen models finetuning☆104Updated 9 months ago
- TinyRAG☆378Updated 5 months ago
- LLaMA Factory Document☆159Updated last week
- Imitate OpenAI with Local Models☆89Updated last year
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- ☆51Updated 2 years ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆250Updated last year
- ☆371Updated last week
- a toolkit on knowledge distillation for large language models☆218Updated last month