wangshuai09 / vllmLinks
A high-throughput and memory-efficient inference and serving engine for LLMs
☆39Updated 3 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆169Updated last year
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆67Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆253Updated last year
- LLaMA Factory Document☆159Updated last week
- ☆180Updated last week
- ☆84Updated 2 years ago
- Imitate OpenAI with Local Models☆89Updated last year
- 中文预训练ModernBert☆94Updated 8 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆270Updated 4 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆178Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated 2 years ago
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- ☆29Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆45Updated last year
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆72Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- code for piccolo embedding model from SenseTime☆143Updated last year
- this repo is mnbvc text quality classification using fastText☆16Updated 2 years ago
- ☆181Updated 2 years ago
- a-m-team's exploration in large language modeling☆194Updated 6 months ago
- Light local website for displaying performances from different chat models.☆87Updated 2 years ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆132Updated 3 weeks ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆84Updated last year
- 怎么训练一个LLM分词器☆154Updated 2 years ago
- 使用单个24G显卡,从0开始训练LLM☆55Updated 5 months ago
- ☆235Updated last year
- A more efficient GLM implementation!☆54Updated 2 years ago
- 更纯粹、更高压缩率的Tokenizer☆486Updated last year