owenliang / mnist-onnx-runtimeLinks
MoE model with onnx runtime
☆53Updated last year
Alternatives and similar repositories for mnist-onnx-runtime
Users that are interested in mnist-onnx-runtime are comparing it to the libraries listed below
Sorting:
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆106Updated 3 weeks ago
- 人工智能培训课件资源☆113Updated last week
- 一些大语言模型和多模态模型的生态,主 要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆190Updated last month
- 大模型/LLM推理和部署理论与实践☆341Updated 2 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆365Updated 2 weeks ago
- LLM101n: Let's build a Storyteller 中文版☆132Updated last year
- LLM Tokenizer with BPE algorithm☆39Updated last year
- DeepSpeed Tutorial☆102Updated last year
- 《自然语言处理:大模型理论与实践》配套数据和代码☆68Updated last week
- LLM 推理服务性能测试☆44Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆67Updated last year
- TinyRAG☆341Updated 3 months ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Updated 8 months ago
- ☆101Updated 6 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- ☆50Updated 10 months ago
- 通义千问VLLM推理部署DEMO☆608Updated last year
- ☆53Updated 6 months ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆214Updated last year
- 一些 LLM 方面的从零复现笔记☆221Updated 4 months ago
- 从0开始,将chatgpt的技术路线跑一遍。☆258Updated last year
- 从零到一实现一个 miniLLM~(动手学习LLM)☆76Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆72Updated last year
- pretrain a wiki llm using transformers☆51Updated last year
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆323Updated 3 months ago
- ☆103Updated last year
- 基于昇腾310芯片的大语言模型部署☆23Updated last year
- qwen ai agent☆139Updated last year
- 筱可的工程实验仓库!☆85Updated 2 weeks ago
- Qwen3 Fine-tuning: Medical R1 Style Chat☆180Updated 3 months ago