pandada8 / llm-inference-benchmarkLinks
LLM 推理服务性能测试
☆44Updated last year
Alternatives and similar repositories for llm-inference-benchmark
Users that are interested in llm-inference-benchmark are comparing it to the libraries listed below
Sorting:
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆267Updated 2 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆115Updated 2 weeks ago
- LLM101n: Let's build a Storyteller 中文版☆135Updated last year
- ☆175Updated this week
- ☆64Updated last week
- 通义千问VLLM推理部署DEMO☆614Updated last year
- LLM Inference benchmark☆428Updated last year
- ☆115Updated 11 months ago
- ☆234Updated last year
- ☆360Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆347Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆74Updated last year
- ☆508Updated last month
- FlagScale is a large model toolkit based on open-sourced projects.☆364Updated last week
- Train a 1B LLM with 1T tokens from scratch by personal☆741Updated 6 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆191Updated 2 months ago
- Accelerate inference without tears☆361Updated 2 weeks ago
- TinyRAG☆355Updated 4 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆433Updated last week
- Inference code for LLaMA models☆127Updated 2 years ago
- 使用单个24G显卡,从0开始训练LLM☆56Updated 3 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆473Updated 5 months ago
- LLM Tokenizer with BPE algorithm☆43Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆58Updated last year
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆71Updated 8 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆217Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆264Updated last year
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆411Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆138Updated 10 months ago
- ☆51Updated 2 years ago