inisis / OnnxSlimLinks

A Toolkit to Help Optimize Onnx Model

☆188

Alternatives and similar repositories for OnnxSlim

Users that are interested in OnnxSlim are comparing it to the libraries listed below

Sorting:

tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
wangzhaode / onnx-llm
llm deploy project based onnx.
☆42Updated 9 months ago
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated 6 months ago
onnx / neural-compressor
Model compression for ONNX
☆97Updated 8 months ago
torchpipe / torchpipe
Serving Inside Pytorch
☆163Updated this week
levipereira / yolov9-qat
Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.
☆118Updated 3 months ago
FeiGeChuanShu / segment-anything-ncnn
an example of segment-anything infer by ncnn
☆123Updated 2 years ago
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆276Updated 2 weeks ago
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆289Updated last year
daquexian / faster-rwkv
☆124Updated last year
wangzhaode / mnn-yolo
mnn yolo demos.
☆78Updated 9 months ago
wangzhaode / mnn-stable-diffusion
stable diffusion using mnn
☆66Updated last year
ZHEQIUSHUI / SAM-ONNX-AX650-CPP
SAM and lama inpaint，包含QT的GUI交互界面，实现了交互式可实时显示结果的画点、画框进行SAM，然后通过进行Inpaint，具体操作看readme里的视频。
☆48Updated last year
leimao / ONNX-Runtime-Inference
ONNX Runtime Inference C++ Example
☆241Updated 4 months ago
MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆77Updated 3 weeks ago
luchangli03 / onnxsim_large_model
simplify >2GB large onnx model
☆61Updated 8 months ago
lrw04 / llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
☆81Updated last year
aadhithya / onnx-typecast
Script to typecast ONNX model parameters from INT64 to INT32.
☆107Updated last year
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆296Updated last year
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆77Updated last week
pnnx / pnnx
PyTorch Neural Network eXchange
☆605Updated last week
FeiGeChuanShu / trt2023
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆42Updated last year
DanielSarmiento04 / yolov10cpp
Implementation of yolo v10 in c++ std 17 over opencv and onnxruntime
☆87Updated 10 months ago
BaofengZan / GOT-OCRv2-onnx
用于学习GOT/Qwen/OnnxLLm
☆53Updated 9 months ago
NVIDIA-AI-IOT / NVIDIA-Optical-Character-Detection-and-Recognition-Solution
This repository provides optical character detection and recognition solution optimized on Nvidia devices.
☆76Updated 2 months ago
wangzhaode / llm-export
llm-export can export llm model to onnx.
☆301Updated 6 months ago
triple-Mu / TensorRT2ONNX
A tool convert TensorRT engine/plan to a fake onnx
☆41Updated 2 years ago
levipereira / triton-server-yolo
This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes
☆65Updated last year
daquexian / web-model-converter
☆41Updated 2 years ago
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆447Updated this week