cap-lab / jedi
Jetson embedded platform-target deep learning inference acceleration framework with TensorRT
☆27Updated last week
Alternatives and similar repositories for jedi:
Users that are interested in jedi are comparing it to the libraries listed below
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- ☆141Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆24Updated 3 years ago
- ☆226Updated 2 years ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆54Updated 9 months ago
- llama INT4 cuda inference with AWQ☆53Updated 2 months ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆196Updated 2 years ago
- ☆36Updated 5 months ago
- Manually implemented quantization-aware training☆21Updated 2 years ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆51Updated 2 months ago
- code reading for tvm☆76Updated 3 years ago
- A set of examples around MegEngine☆31Updated last year
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆191Updated 9 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆90Updated last month
- Offline Quantization Tools for Deploy.☆126Updated last year
- NART = NART is not A RunTime, a deep learning inference framework.☆38Updated 2 years ago
- PyTorch Quantization Aware Training Example☆132Updated 10 months ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆33Updated 3 years ago
- ☆61Updated 4 months ago
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆212Updated 6 months ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆63Updated 4 years ago
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆324Updated 2 years ago
- Common libraries for PPL projects☆29Updated 3 weeks ago
- ☆202Updated 3 years ago
- play gemm with tvm☆89Updated last year
- ☆19Updated 4 years ago
- ☆87Updated last year
- Tencent Distribution of TVM☆15Updated last year