cap-lab / jedi
Jetson embedded platform-target deep learning inference acceleration framework with TensorRT
☆26Updated this week
Alternatives and similar repositories for jedi:
Users that are interested in jedi are comparing it to the libraries listed below
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆89Updated 4 months ago
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆323Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆197Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆189Updated 9 months ago
- YOLOv5 on Orin DLA☆190Updated last year
- A set of examples around MegEngine☆31Updated last year
- ☆141Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆24Updated 3 years ago
- Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆62Updated 4 years ago
- PyTorch Quantization Aware Training Example☆130Updated 9 months ago
- Offline Quantization Tools for Deploy.☆124Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆53Updated 8 months ago
- ☆35Updated 5 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆101Updated 2 weeks ago
- PyTorch Static Quantization Example☆38Updated 3 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- Common libraries for PPL projects☆29Updated this week
- ☆44Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆96Updated 3 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆101Updated 2 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- Quantization of Convolutional Neural networks.☆244Updated 7 months ago
- Post-Training Quantization for Vision transformers.☆206Updated 2 years ago
- ☆17Updated 4 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆268Updated 3 years ago
- code reading for tvm☆74Updated 3 years ago
- ☆34Updated last year
- VeriSilicon Tensor Interface Module☆230Updated 2 months ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆427Updated last year