cap-lab / jedi
Jetson embedded platform-target deep learning inference acceleration framework with TensorRT
☆28Updated last month
Alternatives and similar repositories for jedi:
Users that are interested in jedi are comparing it to the libraries listed below
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆193Updated 10 months ago
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆326Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆197Updated 2 years ago
- ☆143Updated 2 years ago
- ☆36Updated 6 months ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆53Updated 3 months ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- A Winograd Minimal Filter Implementation in CUDA☆24Updated 3 years ago
- A set of examples around MegEngine☆31Updated last year
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆84Updated 10 months ago
- play gemm with tvm☆90Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆57Updated 10 months ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- ☆66Updated 2 years ago
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆217Updated last week
- Offline Quantization Tools for Deploy.☆127Updated last year
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆91Updated 3 weeks ago
- YOLOv5 on Orin DLA☆198Updated last year
- PyTorch Quantization Aware Training Example☆135Updated 11 months ago
- Tencent Distribution of TVM☆15Updated 2 years ago
- ☆69Updated 2 years ago
- code reading for tvm☆76Updated 3 years ago
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆55Updated 2 years ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆13Updated 6 months ago
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆13Updated 3 years ago
- Collection of blogs on AI development☆19Updated 5 months ago
- distributed CNN inference at the edge, extend ncnn with CUDA, MPI+OPENMP support.☆18Updated last year
- ☆58Updated 5 months ago