NVIDIA / Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆193Updated 10 months ago
Alternatives and similar repositories for Deep-Learning-Accelerator-SW:
Users that are interested in Deep-Learning-Accelerator-SW are comparing it to the libraries listed below
- YOLOv5 on Orin DLA☆198Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆326Updated 2 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆230Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆57Updated 10 months ago
- A parser, editor and profiler tool for ONNX models.☆425Updated 3 months ago
- Using pattern matcher in onnx model to match and replace subgraphs.☆78Updated last year
- Deep Learning tools and applications for NVIDIA AGX platforms.☆214Updated last week
- TensorRT Plugin Autogen Tool☆370Updated 2 years ago
- Collection of blogs on AI development☆19Updated 5 months ago
- Offline Quantization Tools for Deploy.☆127Updated last year
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆102Updated 2 years ago
- Edge AI Software and Development Tools☆140Updated 3 weeks ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆107Updated 2 months ago
- CUDA Matrix Multiplication Optimization☆181Updated 9 months ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆67Updated last year
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆131Updated 2 years ago
- TensorRT 7 C++ (almost) minimal examples☆80Updated last year
- A tutorial for CUDA&PyTorch☆132Updated 3 months ago
- ☆275Updated 2 years ago
- Common utilities for ONNX converters☆266Updated 4 months ago
- A Toolkit to Help Optimize Large Onnx Model☆153Updated 11 months ago
- nvidia-modelopt is a unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculat…☆870Updated this week
- This repository provides YOLOV5 GPU optimization sample☆102Updated 2 years ago
- ☆122Updated last year
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- ☆148Updated 3 months ago
- yolo model qat and deploy with deepstream&tensorrt☆568Updated 7 months ago
- Jetson embedded platform-target deep learning inference acceleration framework with TensorRT☆28Updated last month
- 使用 CUDA C++ 实现的 llama 模型推理框架☆49Updated 5 months ago
- Experimental projects related to TensorRT☆97Updated this week