wangxb96 / Awesome-EdgeAILinks
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
☆98Updated 2 months ago
Alternatives and similar repositories for Awesome-EdgeAI
Users that are interested in Awesome-EdgeAI are comparing it to the libraries listed below
Sorting:
- A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlo…☆70Updated last year
- ☆92Updated last month
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆219Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆74Updated 2 years ago
- A network slimming-based pruning method for YOLOv8.☆36Updated last year
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆96Updated last year
- On-Device Training Under 256KB Memory [NeurIPS'22]☆497Updated last year
- ☆69Updated 3 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆30Updated 3 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆619Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆356Updated 3 years ago
- This is a list of awesome edgeAI inference related papers.☆98Updated last year
- ☆25Updated last year
- 🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PT…☆402Updated 3 months ago
- YOLOv5 on Orin DLA☆216Updated last year
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆425Updated this week
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆56Updated 9 months ago
- EQ-Net [ICCV 2023]☆30Updated 2 years ago
- Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided …☆174Updated 2 weeks ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆15Updated last year
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆109Updated 3 years ago
- RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration☆25Updated 5 months ago
- Edge AI Software and Development Tools☆150Updated last month
- Post-Training Quantization for Vision transformers.☆233Updated 3 years ago
- ☆210Updated last year
- Low Precision(quantized) Yolov5☆44Updated 7 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Updated 6 months ago
- ☆25Updated 3 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆82Updated 4 years ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆82Updated last year