wangshuai09 / vllmLinks
A high-throughput and memory-efficient inference and serving engine for LLMs
☆38Updated 6 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆169Updated this week
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆61Updated 8 months ago
- ☆30Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 7 months ago
- ☆83Updated last year
- Imitate OpenAI with Local Models☆87Updated 10 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆259Updated last month
- Alpaca Chinese Dataset -- 中文指令微调数据集☆208Updated 9 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- 大语言模型指令调优工具(支持 FlashAttention)☆174Updated last year
- ☆172Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆244Updated 8 months ago
- ☆230Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆94Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆56Updated 11 months ago
- qwen models finetuning☆100Updated 4 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆84Updated 2 months ago
- 更纯粹、更高压缩率的Tokenizer☆480Updated 7 months ago
- Accelerate inference without tears☆319Updated 4 months ago
- 文本去重☆74Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆66Updated 2 years ago
- 中文书籍收录整理, Collection of Chinese Books☆189Updated last year
- ☆148Updated last year
- Mixture-of-Experts (MoE) Language Model☆189Updated 10 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆243Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆388Updated this week
- LLM Inference benchmark☆422Updated 11 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆328Updated 11 months ago
- 中文预训练ModernBert☆75Updated 3 months ago