aadhithya / onnx-typecast
Script to typecast ONNX model parameters from INT64 to INT32.
☆92Updated 4 months ago
Related projects: ⓘ
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆71Updated 2 months ago
- A pytorch to tensorrt convert with dynamic shape support☆255Updated 7 months ago
- Converting weights of Pytorch models to ONNX & TensorRT engines☆46Updated last year
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆266Updated 4 months ago
- NVIDIA-阿里2021 TRT比赛 `二等奖` 代码提交 团队:美迪康 AI Lab☆161Updated 2 years ago
- A Toolkit to Help Optimize Large Onnx Model☆135Updated 4 months ago
- ONNX Runtime Inference C++ Example☆218Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆84Updated 2 years ago
- This repository provides YOLOV5 GPU optimization sample☆100Updated last year
- ☆77Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆53Updated this week
- Utility scripts for editing or modifying onnx models. Utility scripts to summarize onnx model files along with visualization for loop ope…☆79Updated 3 years ago
- Quantization Aware Training☆53Updated 8 months ago
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆160Updated last year
- ☆27Updated last year
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆95Updated 2 years ago
- This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes☆38Updated 3 months ago
- A simple, fully convolutional model for real-time instance segmentation.☆43Updated 4 years ago
- ☆105Updated last year
- Using TensorRT for Inference Model Deployment.☆46Updated 8 months ago
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆78Updated 10 months ago
- triton server ensemble model demo☆30Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆37Updated last year
- A simple tool that can generate TensorRT plugin code quickly.☆216Updated last year
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆155Updated 7 months ago
- Implement popular deep learning networks in pytorch, used by tensorrtx.☆187Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆379Updated 3 weeks ago
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆278Updated 2 years ago
- Yolov5 TensorRT Implementations☆67Updated last year
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆52Updated 3 months ago