Tlntin / Qwen-TensorRT-LLMLinks
☆614Updated 11 months ago
Alternatives and similar repositories for Qwen-TensorRT-LLM
Users that are interested in Qwen-TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- Accelerate inference without tears☆319Updated 4 months ago
- ☆27Updated 8 months ago
- ☆169Updated this week
- ☆90Updated 2 years ago
- export llama to onnx☆129Updated 6 months ago
- 通义千问VLLM推理部署DEMO☆586Updated last year
- llm-export can export llm model to onnx.☆299Updated 5 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆259Updated last month
- ☆455Updated last week
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆429Updated last year
- Inference code for LLaMA models☆122Updated last year
- llm deploy project based mnn. This project has merged into MNN.☆1,597Updated 5 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆809Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 7 months ago
- LLM Inference benchmark☆422Updated 11 months ago
- a lightweight LLM model inference framework☆731Updated last year
- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…☆510Updated last week
- ☆47Updated 8 months ago
- chatglm多gpu用deepspeed和☆407Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆321Updated this week
- Community maintained hardware plugin for vLLM on Ascend☆865Updated this week
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆118Updated 2 years ago
- CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NL…☆388Updated last year
- Best practice for training LLaMA models in Megatron-LM☆657Updated last year
- C++ implementation of Qwen-LM☆596Updated 7 months ago
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆385Updated this week
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆555Updated last year
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,200Updated last week
- Alpaca Chinese Dataset -- 中文指令微调数据集☆208Updated 9 months ago
- 更纯粹、更高压缩率的Tokenizer☆480Updated 7 months ago