Tlntin / Qwen-TensorRT-LLMLinks
☆623Updated last year
Alternatives and similar repositories for Qwen-TensorRT-LLM
Users that are interested in Qwen-TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- Accelerate inference without tears☆370Updated last month
- ☆180Updated 2 weeks ago
- ☆90Updated 2 years ago
- llm-export can export llm model to onnx.☆336Updated last month
- Inference code for LLaMA models☆128Updated 2 years ago
- export llama to onnx☆137Updated 11 months ago
- 通义千问VLLM推理部署DEMO☆633Updated last year
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆443Updated 2 years ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆270Updated 4 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆945Updated this week
- ☆517Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- a lightweight LLM model inference framework☆744Updated last year
- CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NL…☆392Updated 2 years ago
- PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)☆101Updated this week
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆483Updated 7 months ago
- Optimize QWen1.5 models with TensorRT-LLM☆17Updated last year
- llm deploy project based mnn. This project has merged into MNN.☆1,613Updated 11 months ago
- LLM101n: Let's build a Storyteller 中文版☆136Updated last year
- LLM Inference benchmark☆430Updated last year
- Transformer related optimization, including BERT, GPT☆39Updated 2 years ago
- Best practice for training LLaMA models in Megatron-LM☆664Updated last year
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆120Updated 2 years ago
- C++ implementation of Qwen-LM☆611Updated last year
- 中文书籍收录整理, Collection of Chinese Books☆201Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆357Updated last year
- Baichuan2代码的逐行解析版本,适合小白☆213Updated 2 years ago
- chatglm多gpu用deepspeed和☆411Updated last year
- ☆51Updated 2 years ago