NVIDIA / Deep-Learning-Accelerator-SWLinks
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆200Updated last year
Alternatives and similar repositories for Deep-Learning-Accelerator-SW
Users that are interested in Deep-Learning-Accelerator-SW are comparing it to the libraries listed below
Sorting:
- YOLOv5 on Orin DLA☆204Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆332Updated 3 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆232Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆61Updated last month
- Deep Learning tools and applications for NVIDIA AGX platforms.☆226Updated this week
- TensorRT Plugin Autogen Tool☆369Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆442Updated 2 weeks ago
- BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).☆493Updated last year
- Collection of blogs on AI development☆19Updated 7 months ago
- Inference of quantization aware trained networks using TensorRT☆82Updated 2 years ago
- Offline Quantization Tools for Deploy.☆129Updated last year
- Using pattern matcher in onnx model to match and replace subgraphs.☆80Updated last year
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆135Updated 2 years ago
- ☆58Updated 7 months ago
- A tutorial for CUDA&PyTorch☆146Updated 5 months ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆69Updated last year
- TensorRT 7 C++ (almost) minimal examples☆81Updated last year
- Experimental projects related to TensorRT☆105Updated last week
- Edge AI Software and Development Tools☆145Updated 2 months ago
- code reading for tvm☆76Updated 3 years ago
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆167Updated 2 years ago
- ☆66Updated 2 years ago
- ☆149Updated 2 years ago
- ☆283Updated 3 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆164Updated 8 months ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆102Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆110Updated 9 months ago
- ☆36Updated 8 months ago
- CUDA Matrix Multiplication Optimization☆196Updated 11 months ago
- Using Unified Memory on Jetson☆27Updated 3 years ago