cap-lab / jediLinks
Jetson embedded platform-target deep learning inference acceleration framework with TensorRT
☆29Updated last month
Alternatives and similar repositories for jedi
Users that are interested in jedi are comparing it to the libraries listed below
Sorting:
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆218Updated last year
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- ☆161Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆94Updated 11 months ago
- A set of examples around MegEngine☆31Updated last year
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆55Updated 8 months ago
- ☆241Updated 2 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆30Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆35Updated 3 years ago
- Offline Quantization Tools for Deploy.☆138Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆349Updated 3 years ago
- ☆36Updated 2 years ago
- ☆44Updated 4 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆200Updated 3 years ago
- PyTorch Quantization Aware Training Example☆140Updated last year
- Fast CUDA Kernels for ResNet Inference.☆180Updated 6 years ago
- llama INT4 cuda inference with AWQ☆55Updated 8 months ago
- A Winograd Minimal Filter Implementation in CUDA☆28Updated 4 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆418Updated last week
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆58Updated 2 years ago
- tophub autotvm log collections☆69Updated 2 years ago
- ☆11Updated 8 months ago
- This is a list of awesome edgeAI inference related papers.☆98Updated last year
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆161Updated this week
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆262Updated 2 years ago
- Post-Training Quantization for Vision transformers.☆227Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆99Updated 4 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆450Updated 2 years ago
- EQ-Net [ICCV 2023]☆30Updated 2 years ago
- ☆98Updated 4 years ago