NVIDIA / Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆190Updated 9 months ago
Alternatives and similar repositories for Deep-Learning-Accelerator-SW:
Users that are interested in Deep-Learning-Accelerator-SW are comparing it to the libraries listed below
- YOLOv5 on Orin DLA☆191Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆323Updated 2 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆228Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆53Updated 9 months ago
- Collection of blogs on AI development☆19Updated 4 months ago
- A parser, editor and profiler tool for ONNX models.☆421Updated 2 months ago
- Deep Learning tools and applications for NVIDIA AGX platforms.☆196Updated last week
- Common utilities for ONNX converters☆259Updated 3 months ago
- TensorRT Plugin Autogen Tool☆369Updated last year
- Using Unified Memory on Jetson☆25Updated 3 years ago
- A Toolkit to Help Optimize Large Onnx Model☆153Updated 10 months ago
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- Edge AI Software and Development Tools☆138Updated 2 months ago
- Using pattern matcher in onnx model to match and replace subgraphs.☆77Updated last year
- TensorRT 7 C++ (almost) minimal examples☆80Updated last year
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆102Updated 3 weeks ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆66Updated last year
- Offline Quantization Tools for Deploy.☆125Updated last year
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆290Updated 10 months ago
- ☆159Updated last year
- This repository provides YOLOV5 GPU optimization sample☆103Updated 2 years ago
- ☆266Updated 2 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆101Updated 2 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆163Updated 5 months ago
- Experimental projects related to TensorRT☆94Updated this week
- An onnx-based quantitation tool.☆71Updated last year
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆209Updated 5 months ago
- CUDA Matrix Multiplication Optimization☆173Updated 8 months ago
- code reading for tvm☆75Updated 3 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆48Updated 4 months ago