Tlntin / Qwen-TensorRT-LLM
☆585Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Qwen-TensorRT-LLM
- ☆291Updated 4 months ago
- llm-export can export llm model to onnx.☆231Updated last week
- ☆290Updated last week
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆547Updated last month
- ☆145Updated this week
- 通义千问VLLM推理部署DEMO☆446Updated 7 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆262Updated this week
- Best practice for training LLaMA models in Megatron-LM☆628Updated 10 months ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆723Updated this week
- C++ implementation of Qwen-LM☆554Updated 10 months ago
- ☆90Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆399Updated last year
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆374Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆269Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs