owenliang / mnist-onnx-runtimeLinks

MoE model with onnx runtime

☆48

Alternatives and similar repositories for mnist-onnx-runtime

Users that are interested in mnist-onnx-runtime are comparing it to the libraries listed below

Sorting:

owenliang / bpe-tokenizer
LLM Tokenizer with BPE algorithm
☆33Updated last year
preacher-1 / MLA_tutorial
from MHA, MQA, GQA to MLA by 苏剑林, with code
☆23Updated 5 months ago
hyperai / vllm-cn
vLLM Documentation in Chinese Simplified / vLLM 中文文档
☆85Updated 2 months ago
git-cloner / ai-course
人工智能培训课件资源
☆103Updated last week
datawhalechina / llm-deploy
大模型/LLM推理和部署理论与实践
☆293Updated last week
AI-Study-Han / Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆64Updated 10 months ago
RethinkFun / trian_ppo
☆90Updated 9 months ago
LDLINGLINGLING / adan_application
一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR
☆184Updated last month
SmartFlowAI / LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
☆131Updated 11 months ago
AIFlowPlayer / LMDeploy-Jetson
Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…
☆98Updated last year
lansinuote / Simple_LLM_PPO
☆44Updated 11 months ago
SmartFlowAI / TheGodOfCookery
☆94Updated 4 months ago
sunkx109 / llama
Inference code for LLaMA models
☆122Updated last year
pandada8 / llm-inference-benchmark
LLM 推理服务性能测试
☆42Updated last year
owenliang / DeepSeek-Distill-Qwen-For-Child
☆47Updated 4 months ago
liujunwen23 / MIRE
WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge
☆122Updated 8 months ago
chaoswork / llm_illustrated
看图学大模型
☆314Updated 11 months ago
owenliang / qwen-dpo
通义千问的DPO训练
☆51Updated 10 months ago
BoXiaolei / MyTransformer_pytorch
关于Transformer模型的最简洁pytorch实现，包含详细注释
☆205Updated last year
liguodongiot / unify-easy-llm
unify-easy-llm（ULM）旨在打造一个简易的一键式大模型训练工具，支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。
☆56Updated 11 months ago
KMnO4-zx / TinyRAG
TinyRAG
☆315Updated 3 weeks ago
datawhalechina / awesome-compression
模型压缩的小白入门教程，PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases
☆305Updated last month
owenliang / qwen-vllm
通义千问VLLM推理部署DEMO
☆589Updated last year
Mxoder / LLM-from-scratch
一些 LLM 方面的从零复现笔记
☆209Updated 2 months ago
bobo0810 / LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
☆169Updated last year
bbruceyuan / LLMs-101
从零到一实现一个 miniLLM～（动手学习LLM）
☆75Updated last year
sophgo / ChatGLM2-TPU
run ChatGLM2-6B in BM1684X
☆49Updated last year
KMnO4-zx / xlab-huanhuan
☆64Updated last year
HCPLab-SYSU / Book-of-MLM
《多模态大模型：新一代人工智能技术范式》作者：刘阳，林倞
☆221Updated 7 months ago
AI-Study-Han / Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
☆244Updated 10 months ago