TensorRT encapsulation, learn, rewrite, practice.
☆29Oct 19, 2022Updated 3 years ago
Alternatives and similar repositories for trt_learn
Users that are interested in trt_learn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- async inference for machine learning model☆26Sep 21, 2022Updated 3 years ago
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 4 years ago
- Awesome code, projects, books, etc. related to CUDA☆31Feb 3, 2026Updated last month
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆82May 26, 2025Updated 9 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆11Dec 31, 2024Updated last year
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- 对 tensorRT_Pro 开源项目理解☆21Feb 23, 2023Updated 3 years ago
- 跟着Tensorrt_pro学习各种知识☆38Nov 25, 2022Updated 3 years ago
- This project provides simple code and demonstrates how to use the TensorRT C++ API and ONNX to deploy PaddleOCR text recognition model.☆51Aug 5, 2022Updated 3 years ago
- This is a repository to practice multi-thread programming in C++☆27Feb 21, 2024Updated 2 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- Step-by-step optimization of CUDA SGEMM☆445Mar 30, 2022Updated 3 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆20Jul 20, 2022Updated 3 years ago
- ☆23Aug 14, 2024Updated last year
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆30Dec 21, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- TensorRT实现BiSeNetV1与BiSeNetV2部署☆20Apr 14, 2022Updated 3 years ago
- 记录yolov5的TensorRT量化及推理代码,经实测可运行于Jetson平台☆20May 11, 2023Updated 2 years ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Feb 4, 2025Updated last year
- Cute layout visualization☆33Jan 18, 2026Updated 2 months ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆63Nov 8, 2024Updated last year
- ☆13Nov 3, 2025Updated 4 months ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- a simple lightweight large language model pipeline framework.☆28Apr 25, 2025Updated 10 months ago
- rknn inference☆48Mar 7, 2022Updated 4 years ago
- Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming mo…☆168May 8, 2025Updated 10 months ago
- ☆61Sep 13, 2025Updated 6 months ago
- ☆114Mar 11, 2024Updated 2 years ago
- ☆32Jul 2, 2025Updated 8 months ago
- This project is the Torch implementation of our ICCV 2017 paper: Centered Weight Normalization in Accelerating Training of Deep Neural…☆21Dec 7, 2019Updated 6 years ago
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- ☆49Apr 15, 2024Updated last year
- bluetooth gyroscopic mouse like a wii remote control☆14Oct 17, 2021Updated 4 years ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Jul 29, 2025Updated 7 months ago
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆170Jul 24, 2022Updated 3 years ago