owenliang / mnist-onnx-runtimeLinks
MoE model with onnx runtime
☆48Updated last year
Alternatives and similar repositories for mnist-onnx-runtime
Users that are interested in mnist-onnx-runtime are comparing it to the libraries listed below
Sorting:
- LLM Tokenizer with BPE algorithm☆33Updated last year
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆23Updated 5 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆85Updated 2 months ago
- 人工智能培训课件资源☆103Updated last week
- 大模型/LLM推理和部署理论与实践☆293Updated last week
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆64Updated 10 months ago
- ☆90Updated 9 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆184Updated last month
- LLM101n: Let's build a Storyteller 中文版☆131Updated 11 months ago
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆98Updated last year
- ☆44Updated 11 months ago
- ☆94Updated 4 months ago
- Inference code for LLaMA models☆122Updated last year
- LLM 推理服务性能测试☆42Updated last year
- ☆47Updated 4 months ago
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆122Updated 8 months ago
- 看图学大模型☆314Updated 11 months ago
- 通义千问的DPO训练☆51Updated 10 months ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆205Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆56Updated 11 months ago
- TinyRAG☆315Updated 3 weeks ago
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆305Updated last month
- 通义千问VLLM推理部署DEMO☆589Updated last year
- 一些 LLM 方面的从零复现笔记☆209Updated 2 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模 型高效训练)☆169Updated last year
- 从零到一实现一个 miniLLM~(动手学习LLM)☆75Updated last year
- run ChatGLM2-6B in BM1684X☆49Updated last year
- ☆64Updated last year
- 《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞☆221Updated 7 months ago
- 从0开始,将chatgpt的技术路线跑一遍。☆244Updated 10 months ago