lix19937 / tensorrt-insight
Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda
☆16Updated 2 weeks ago
Alternatives and similar repositories for tensorrt-insight:
Users that are interested in tensorrt-insight are comparing it to the libraries listed below
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆57Updated 10 months ago
- Common libraries for PPL projects☆29Updated last month
- ☆36Updated 6 months ago
- ☆17Updated 4 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆38Updated 2 years ago
- CUDA 6大并行计算模式 代码与笔记☆60Updated 4 years ago
- ☆24Updated 2 years ago
- ☆16Updated last year
- ☆17Updated last year
- A ROS 1/ROS 2 hybrid package wrapping the Apache TVM project.☆10Updated 2 years ago
- A simple neural network inference framework☆24Updated last year
- study of cutlass☆21Updated 5 months ago
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆27Updated 8 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆10Updated 3 years ago
- Python C++ Code Manager☆14Updated 6 months ago
- This is a demo how to write a high performance convolution run on apple silicon☆54Updated 3 years ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆30Updated last year
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆29Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆108Updated 7 months ago
- ☆25Updated 3 years ago
- TVM learning and research☆13Updated 4 years ago
- edge/mobile transformer based Vision DNN inference benchmark☆16Updated 3 months ago
- SGEMM optimization with cuda step by step☆18Updated last year
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆91Updated 3 weeks ago
- Yet another Polyhedra Compiler for DeepLearning☆19Updated 2 years ago
- This is a repository to practice multi-thread programming in C++☆24Updated last year
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Updated last year
- ☆20Updated 4 years ago
- YOLOv5 on Orin DLA☆198Updated last year
- symmetric int8 gemm☆67Updated 4 years ago