owenliang / mnist-onnx-runtimeLinks
MoE model with onnx runtime
☆49Updated last year
Alternatives and similar repositories for mnist-onnx-runtime
Users that are interested in mnist-onnx-runtime are comparing it to the libraries listed below
Sorting:
- LLM Tokenizer with BPE algorithm☆33Updated last year
- 大模型/LLM推理和部署理论与实践☆308Updated last month
- 人工智能培训课件资源☆107Updated last week
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆92Updated 3 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆186Updated 2 weeks ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆226Updated 2 weeks ago
- ☆51Updated 5 months ago
- 从零到一实现一个 miniLLM~(动手学习LLM)☆76Updated last year
- 通义千问VLLM推理部署DEMO☆594Updated last year
- DeepSpeed Tutorial☆101Updated last year
- LLM101n: Let's build a Storyteller 中文版☆132Updated last year
- 通义千问的DPO训练☆52Updated 10 months ago
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆312Updated 2 months ago
- LLM 推理服务性能测试☆44Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆65Updated 11 months ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆211Updated last year
- 看图学大模型☆316Updated last year
- ☆96Updated 5 months ago
- TinyRAG☆319Updated last month
- ☆50Updated 9 months ago
- ☆172Updated this week
- 《自然语言处理:大模型理论与实践》配套数据和代码☆68Updated 7 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆65Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆250Updated 11 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆334Updated last year
- pretrain a wiki llm using transformers☆49Updated 11 months ago
- ☆95Updated 10 months ago
- ☆65Updated last year
- ☆45Updated last year
- qwen ai agent☆136Updated last year