Tlntin / Qwen-TensorRT-LLMLinks
☆611Updated 10 months ago
Alternatives and similar repositories for Qwen-TensorRT-LLM
Users that are interested in Qwen-TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- Accelerate inference without tears☆318Updated 3 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆801Updated 3 weeks ago
- ☆336Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆256Updated 3 weeks ago
- llm-export can export llm model to onnx.☆295Updated 5 months ago
- LLM Inference benchmark☆421Updated 11 months ago
- ☆168Updated this week
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆424Updated last year
- ☆27Updated 7 months ago
- export llama to onnx☆126Updated 5 months ago
- ☆128Updated 6 months ago
- Best practice for training LLaMA models in Megatron-LM☆656Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆307Updated this week
- Baichuan2代码的逐行解析版本,适合小白☆214Updated last year
- C++ implementation of Qwen-LM☆594Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 6 months ago
- 通义千问VLLM推理部署DEMO☆581Updated last year
- ☆90Updated last year
- OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…☆261Updated 6 months ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,151Updated 2 weeks ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆441Updated last month
- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…☆492Updated this week
- 一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。☆217Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆384Updated this week
- Train a 1B LLM with 1T tokens from scratch by personal☆679Updated last month
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆474Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆328Updated 11 months ago
- Inference code for LLaMA models☆121Updated last year
- ☆51Updated last week
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆411Updated last year