cap-lab / jediLinks
Jetson embedded platform-target deep learning inference acceleration framework with TensorRT
☆28Updated this week
Alternatives and similar repositories for jedi
Users that are interested in jedi are comparing it to the libraries listed below
Sorting:
- Inference of quantization aware trained networks using TensorRT☆81Updated 2 years ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆53Updated 4 months ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆199Updated 11 months ago
- Tencent Distribution of TVM☆15Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆24Updated 3 years ago
- ☆149Updated 2 years ago
- play gemm with tvm☆91Updated last year
- A set of examples around MegEngine☆31Updated last year
- code reading for tvm☆76Updated 3 years ago
- ☆36Updated 7 months ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆60Updated last week
- ☆18Updated 2 weeks ago
- ☆17Updated 4 years ago
- llama INT4 cuda inference with AWQ☆54Updated 4 months ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆200Updated 3 years ago
- Offline Quantization Tools for Deploy.☆128Updated last year
- YOLOv5 on Orin DLA☆203Updated last year
- ☆30Updated 2 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆92Updated last week
- ☆21Updated 4 years ago
- CUDA project for uni subject☆23Updated 4 years ago
- ☆58Updated 6 months ago
- PyTorch Quantization Aware Training Example☆136Updated last year
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆102Updated 2 years ago
- Common libraries for PPL projects☆29Updated 2 months ago
- Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆64Updated 4 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆92Updated 7 months ago
- ☆69Updated 2 years ago
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- Benchmark scripts for TVM☆74Updated 3 years ago