tsingmicro-toolchain / OnnxSlimLinks
A Toolkit to Help Optimize Large Onnx Model
☆163Updated 3 months ago
Alternatives and similar repositories for OnnxSlim
Users that are interested in OnnxSlim are comparing it to the libraries listed below
Sorting:
- Serving Inside Pytorch☆170Updated last week
- Large Language Model Onnx Inference Framework☆36Updated 2 months ago
- simplify >2GB large onnx model☆70Updated last year
- ☆125Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- llm deploy project based onnx.☆49Updated last year
- Offline Quantization Tools for Deploy.☆142Updated 2 years ago
- ☆103Updated 4 years ago
- an example of segment-anything infer by ncnn☆123Updated 2 years ago
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- stable diffusion using mnn☆67Updated 2 years ago
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆143Updated 3 years ago
- PyTorch Neural Network eXchange☆675Updated last week
- A Toolkit to Help Optimize Onnx Model☆383Updated this week
- A parser, editor and profiler tool for ONNX models.☆478Updated 3 months ago
- A converter for llama2.c legacy models to ncnn models.☆79Updated 2 years ago
- TensorRT encapsulation, learn, rewrite, practice.☆29Updated 3 years ago
- ☆26Updated 2 years ago
- ☆120Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated 2 weeks ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- Compare multiple optimization methods on triton to imporve model service performance☆52Updated 2 years ago
- ☆70Updated 3 years ago
- Utility scripts for editing or modifying onnx models. Utility scripts to summarize onnx model files along with visualization for loop ope…☆80Updated 4 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆239Updated 2 years ago
- ONNX2Pytorch☆165Updated 4 years ago
- ☆43Updated 3 years ago
- Common utilities for ONNX converters☆293Updated last month
- C++ Helper Class for Deep Learning Inference Frameworks: TensorFlow Lite, TensorRT, OpenCV, OpenVINO, ncnn, MNN, SNPE, Arm NN, NNabla, ON…☆298Updated 3 years ago