wangxb96 / Awesome-EdgeAILinks
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
☆100Updated 4 months ago
Alternatives and similar repositories for Awesome-EdgeAI
Users that are interested in Awesome-EdgeAI are comparing it to the libraries listed below
Sorting:
- A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlo…☆71Updated last year
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆224Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆362Updated 3 years ago
- ☆110Updated 2 weeks ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆98Updated last year
- YOLOv5 on Orin DLA☆217Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆76Updated 2 years ago
- Edge AI Software and Development Tools☆155Updated 3 months ago
- Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided …☆178Updated 3 weeks ago
- A network slimming-based pruning method for YOLOv8.☆38Updated last year
- This is a list of awesome edgeAI inference related papers.☆98Updated 2 years ago
- On-Device Training Under 256KB Memory [NeurIPS'22]☆508Updated last year
- Flexible DNN inference under changing memory budgets☆58Updated 11 months ago
- base quantization methods including: QAT, PTQ, per_channel, per_tensor, dorefa, lsq, adaround, omse, Histogram, bias_correction.etc☆51Updated 3 years ago
- EQ-Net [ICCV 2023]☆30Updated 2 years ago
- ☆25Updated last year
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆30Updated 3 years ago
- ☆70Updated 3 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Updated 8 months ago
- Quantization Aware Training☆84Updated 2 years ago
- 🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PT…☆432Updated 5 months ago
- YOLOv5 Quantization Aware Training (QAT, qat_torch branch) and Post Training Quantization with ONNX (ptq_onnx branch ptq_onnx.ipynb)☆15Updated 2 years ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆71Updated 2 years ago
- [DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive La…☆76Updated last year
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆639Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆78Updated 7 months ago
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆37Updated last year
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆110Updated 3 years ago
- Post-Training Quantization for Vision transformers.☆236Updated 3 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆430Updated last week