Tencent / TPATLinks
TensorRT Plugin Autogen Tool
☆367Updated 2 years ago
Alternatives and similar repositories for TPAT
Users that are interested in TPAT are comparing it to the libraries listed below
Sorting:
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆514Updated last year
- A simple tool that can generate TensorRT plugin code quickly.☆238Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆473Updated 2 months ago
- A library for high performance deep learning inference on NVIDIA GPUs.☆558Updated 3 years ago
- Deploy your model with TensorRT quickly.☆765Updated 2 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆489Updated last year
- Inference of quantization aware trained networks using TensorRT☆82Updated 2 years ago
- Compiler Infrastructure for Neural Networks☆147Updated 2 years ago
- ☆27Updated 2 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated last year
- Offline Quantization Tools for Deploy.☆141Updated 2 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆912Updated last year
- Collection of blogs on AI development☆21Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆143Updated 3 years ago
- ☆141Updated last year
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆407Updated 3 years ago
- A primitive library for neural network☆1,369Updated last year
- ⚡ Useful scripts when using TensorRT☆237Updated 5 years ago
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆171Updated 5 years ago
- ☆308Updated 3 years ago
- Model Quantization Benchmark☆855Updated 8 months ago
- Serving Inside Pytorch☆169Updated last week
- ☆38Updated last year
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- row-major matmul optimization☆698Updated 4 months ago
- ONNX2Pytorch☆165Updated 4 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆476Updated last year
- ONNX Optimizer☆787Updated this week
- ☆60Updated last year
- A model compression and acceleration toolbox based on pytorch.☆333Updated 2 years ago