Tlntin / Qwen-TensorRT-LLMLinks
☆626Updated last year
Alternatives and similar repositories for Qwen-TensorRT-LLM
Users that are interested in Qwen-TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- Accelerate inference without tears☆367Updated last month
- ☆27Updated last year
- llm-export can export llm model to onnx.☆328Updated 3 weeks ago
- ☆177Updated this week
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆926Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆268Updated 3 months ago
- export llama to onnx☆136Updated 10 months ago
- ☆90Updated 2 years ago
- Optimize QWen1.5 models with TensorRT-LLM☆17Updated last year
- ☆512Updated 2 months ago
- Best practice for training LLaMA models in Megatron-LM☆659Updated last year
- C++ implementation of Qwen-LM☆606Updated 11 months ago
- a lightweight LLM model inference framework☆741Updated last year
- llm deploy project based mnn. This project has merged into MNN.☆1,609Updated 10 months ago
- 通义千问VLLM推理部署DEMO☆620Updated last year
- LLM Inference benchmark☆430Updated last year
- LLaMa/RWKV onnx models, quantization and testcase☆367Updated 2 years ago
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆440Updated 2 years ago
- FlagScale is a large model toolkit based on open-sourced projects.☆407Updated last week
- OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…☆262Updated 11 months ago
- ☆52Updated last year
- Inference code for LLaMA models