Tencent / TPATLinks
TensorRT Plugin Autogen Tool
☆369Updated 2 years ago
Alternatives and similar repositories for TPAT
Users that are interested in TPAT are comparing it to the libraries listed below
Sorting:
- A simple tool that can generate TensorRT plugin code quickly.☆231Updated last year
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆503Updated 7 months ago
- A parser, editor and profiler tool for ONNX models.☆436Updated this week
- Deploy your model with TensorRT quickly.☆768Updated last year
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated 7 months ago
- ⚡ Useful scripts when using TensorRT☆242Updated 4 years ago
- ☆281Updated 3 years ago
- ☆1,026Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆871Updated 5 months ago
- Serving Inside Pytorch☆160Updated 3 weeks ago
- ☆138Updated last year
- ☆127Updated 5 months ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆986Updated 8 months ago
- Yinghan's Code Sample☆329Updated 2 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆485Updated 7 months ago
- DeepLearning Framework Performance Profiling Toolkit☆284Updated 3 years ago
- A pytorch to tensorrt convert with dynamic shape support☆261Updated last year
- Inference of quantization aware trained networks using TensorRT☆81Updated 2 years ago
- A library for high performance deep learning inference on NVIDIA GPUs.☆552Updated 3 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆197Updated 11 months ago
- row-major matmul optimization☆634Updated last year
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆168Updated 4 years ago
- ONNX Optimizer☆715Updated this week
- Offline Quantization Tools for Deploy.☆128Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆139Updated 2 years ago
- Collection of blogs on AI development☆19Updated 6 months ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆473Updated last year
- Model Quantization Benchmark☆804Updated last month
- A Toolkit to Help Optimize Large Onnx Model☆158Updated last year
- Compiler Infrastructure for Neural Networks☆145Updated last year