Tlntin / Qwen-TensorRT-LLMLinks
☆624Updated last year
Alternatives and similar repositories for Qwen-TensorRT-LLM
Users that are interested in Qwen-TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- Accelerate inference without tears☆370Updated 3 weeks ago
- llm-export can export llm model to onnx.☆334Updated last month
- ☆180Updated last week
- 通义千问VLLM推理部署DEMO☆627Updated last year
- ☆28Updated last year
- export llama to onnx☆137Updated 11 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆940Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆270Updated 4 months ago
- ☆90Updated 2 years ago
- ☆515Updated 3 weeks ago
- C++ implementation of Qwen-LM☆610Updated last year
- ☆52Updated last year
- Best practice for training LLaMA models in Megatron-LM☆663Updated last year
- LLM Inference benchmark☆431Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆355Updated last year
- Inference code for LLaMA models☆128Updated 2 years ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆480Updated 7 months ago
- a lightweight LLM model inference framework☆744Updated last year
- Baichuan2代码的逐行解析版本,适合小白☆214Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆443Updated 2 years ago
- OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…☆262Updated last year
- LLM 推理服务性能测试☆44Updated last year
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆579Updated last year
- Optimize QWen1.5 models with TensorRT-LLM☆17Updated last year
- 《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》☆369Updated 2 years ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆609Updated last year
- The Triton TensorRT-LLM Backend☆910Updated 2 weeks ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆120Updated 2 years ago
- 中文书籍收录整理, Collection of Chinese Books☆201Updated last year