wangshuai09 / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆36Updated 2 months ago
Alternatives and similar repositories for vllm:
Users that are interested in vllm are comparing it to the libraries listed below
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated 3 months ago
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆60Updated 5 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- deep learning☆150Updated 3 weeks ago
- code for piccolo embedding model from SenseTime☆123Updated 10 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆171Updated last year
- accelerate generating vector by using onnx model☆15Updated last year
- ☆159Updated this week
- 中文原生检索增强生成测评基准☆115Updated 11 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆65Updated 2 years ago
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- A more efficient GLM implementation!☆55Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- ☆84Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆333Updated last month
- LLaMA Factory Document☆113Updated 3 weeks ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆247Updated 3 months ago
- ☆166Updated last year
- “悟道”数据☆41Updated 3 years ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆95Updated last year
- ☆225Updated 10 months ago
- Light local website for displaying performances from different chat models.☆85Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆193Updated 5 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆72Updated last year
- 怎么训练一个LLM分词器☆142Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆237Updated 5 months ago
- 中文 Instruction tuning datasets☆129Updated 11 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 11 months ago
- 文本去重☆69Updated 10 months ago