Tencent / TPATLinks
TensorRT Plugin Autogen Tool
☆369Updated 2 years ago
Alternatives and similar repositories for TPAT
Users that are interested in TPAT are comparing it to the libraries listed below
Sorting:
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆506Updated 8 months ago
- A simple tool that can generate TensorRT plugin code quickly.☆232Updated 2 years ago
- Deploy your model with TensorRT quickly.☆767Updated last year
- A parser, editor and profiler tool for ONNX models.☆445Updated last month
- A library for high performance deep learning inference on NVIDIA GPUs.☆553Updated 3 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated 9 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆485Updated 8 months ago
- Compiler Infrastructure for Neural Networks☆146Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆139Updated 3 years ago
- ☆26Updated last year
- ☆139Updated last year
- ☆37Updated 9 months ago
- Offline Quantization Tools for Deploy.☆129Updated last year
- ☆288Updated 3 years ago
- Serving Inside Pytorch☆163Updated this week
- ☆1,031Updated last year
- row-major matmul optimization☆640Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆875Updated 6 months ago
- 服务侧深度学习部署案例☆451Updated 5 years ago
- A primitive library for neural network☆1,344Updated 7 months ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- ⚡ Useful scripts when using TensorRT☆242Updated 4 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆400Updated 2 years ago
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- Model Quantization Benchmark☆820Updated 2 months ago
- Adlik: Toolkit for Accelerating Deep Learning Inference☆801Updated last year
- code reading for tvm☆76Updated 3 years ago
- Collection of blogs on AI development☆19Updated 8 months ago
- ONNX Optimizer☆727Updated last week
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆475Updated last year