Tlntin / Qwen-TensorRT-LLM
☆572Updated last month
Related projects: ⓘ
- ☆284Updated 2 months ago
- GoMate:RAG Framework within Reliable input,Trusted output☆414Updated last week
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆350Updated 11 months ago
- llm-export can export llm model to onnx.☆193Updated this week
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆512Updated last week
- 通义千问VLLM推理部署DEMO☆402Updated 5 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆167Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆246Updated 2 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆328Updated 4 months ago
- ☆251Updated last week
- LLM101n: Let's build a Storyteller 中文版☆113Updated last month
- Best practice for training LLaMA models in Megatron-LM☆606Updated 8 months ago
- ☆90Updated last year
- ☆23Updated 3 months ago
- CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NL…☆375Updated 10 months ago
- export llama to onnx☆91Updated 3 months ago
- ☆131Updated last week
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆649Updated last week
- Inference code for LLaMA models☆101Updated last year
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder☆430Updated last month
- C++ implementation of Qwen-LM☆531Updated 8 months ago
- ChatGLM-6B HTTP流式解码API的Flask、FastAPI实现,以及开箱即用的Web页面。 a stream decoding demo of ChatGLM-6B using Flask or FastAPI, with web page out-of-th…☆92Updated 7 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】☆161Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆130Updated 3 weeks ago
- 开源SFT数据集整理,随时补充☆413Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆226Updated this week
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆396Updated 11 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆1,045Updated last month
- LLM Inference benchmark☆331Updated last month
- ☆247Updated 3 months ago