Tlntin / Qwen-TensorRT-LLM
☆608Updated 9 months ago
Alternatives and similar repositories for Qwen-TensorRT-LLM
Users that are interested in Qwen-TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- Accelerate inference without tears☆314Updated 2 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆718Updated 3 months ago
- llm-export can export llm model to onnx.☆286Updated 3 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆245Updated this week
- 通义千问VLLM推理部署DEMO☆577Updated last year
- ☆329Updated 3 months ago
- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…☆472Updated this week
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆414Updated last year
- Best practice for training LLaMA models in Megatron-LM☆650Updated last year
- ☆90Updated last year
- ☆48Updated last week
- ☆162Updated last month
- Community maintained hardware plugin for vLLM on Ascend☆605Updated this week
- ☆127Updated 4 months ago
- ☆27Updated 6 months ago
- LLM Inference benchmark☆417Updated 9 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆270Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆320Updated 9 months ago
- export llama to onnx☆124Updated 4 months ago
- Baichuan2代码的逐行解析版本,适合小白☆213Updated last year
- ☆308Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 5 months ago
- 开源SFT数据集整理,随时补充☆513Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆431Updated last week
- C++ implementation of Qwen-LM☆587Updated 5 months ago
- Inference code for LLaMA models☆120Updated last year
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆386Updated this week
- ☆139Updated last year
- ☆84Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆410Updated last year