owenliang / mnist-onnx-runtime
MoE model with onnx runtime
☆34Updated 10 months ago
Alternatives and similar repositories for mnist-onnx-runtime:
Users that are interested in mnist-onnx-runtime are comparing it to the libraries listed below
- LLM Tokenizer with BPE algorithm☆31Updated 10 months ago
- ☆30Updated 3 weeks ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- Music large model based on InternLM2-chat.☆22Updated 3 months ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- DeepSpeed Tutorial☆95Updated 7 months ago
- 人工智能培训课件资源☆74Updated this week
- LLM 推理服务性能测试☆39Updated last year
- ☆40Updated 7 months ago
- A simple deep learning framework inspired by Dezero and PyTorch☆29Updated 2 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆55Updated 2 months ago
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆89Updated last year
- Pytorch DDP Traning Demo☆20Updated 5 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆54Updated 6 months ago
- 通义千问的DPO训练☆40Updated 6 months ago
- simple decoder-only GTP model in pytorch☆37Updated 10 months ago
- ☆60Updated last year
- 模型压缩的小白入门教程☆22Updated 8 months ago
- LLM101n: Let's build a Storyteller 中文版☆130Updated 7 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆151Updated last year
- ☆39Updated 4 months ago
- ☆24Updated 2 months ago
- ☆50Updated 6 months ago
- ☆103Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆46Updated last year
- 顾名思义:手搓的RAG☆121Updated last year
- 基于Amazon Bedrock的多模态AIGC童话绘本☆18Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 8 months ago
- Llama2 chinese finetuning☆38Updated last year
- ☆22Updated last month