Tencent / TPAT
TensorRT Plugin Autogen Tool
☆367Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TPAT
- A simple tool that can generate TensorRT plugin code quickly.☆221Updated last year
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆493Updated 3 weeks ago
- ☆228Updated 2 years ago
- Deploy your model with TensorRT quickly.☆762Updated last year
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆159Updated last month
- A parser, editor and profiler tool for ONNX models.☆400Updated this week
- Actively maintained ONNX Optimizer☆647Updated 8 months ago
- row-major matmul optimization☆591Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆277Updated 2 years ago
- ☆990Updated 8 months ago
- A library for high performance deep learning inference on NVIDIA GPUs.☆547Updated 2 years ago
- A pytorch to tensorrt convert with dynamic shape support☆257Updated 9 months ago
- TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillati…☆567Updated this week
- 服务侧深度学习部署案例☆453Updated 4 years ago
- Serving Inside Pytorch☆145Updated this week
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 3 years ago
- Offline Quantization Tools for Deploy.☆116Updated 10 months ago
- Model Quantization Benchmark☆765Updated 5 months ago
- Yinghan's Code Sample☆289Updated 2 years ago
- Inference of quantization aware trained networks using TensorRT☆79Updated last year
- ⚡ Useful scripts when using TensorRT☆240Updated 4 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆393Updated last year
- ☆138Updated 2 weeks ago
- A simple high performance CUDA GEMM implementation.☆335Updated 10 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆816Updated this week
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆457Updated 8 months ago
- ☆140Updated 6 months ago
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆135Updated 2 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆473Updated 3 weeks ago
- YOLOv5 on Orin DLA☆186Updated 9 months ago