wangxb96 / Awesome-EdgeAILinks

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

☆89

Alternatives and similar repositories for Awesome-EdgeAI

Users that are interested in Awesome-EdgeAI are comparing it to the libraries listed below

Sorting:

NVIDIA / Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆209Updated last year
Yulv-git / Model-Inference-Deployment
A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlo…
☆65Updated last year
liangyn22 / MCUFormer
[NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory
☆70Updated last year
TexasInstruments / edgeai
Edge AI Software and Development Tools
☆147Updated 4 months ago
TexasInstruments / edgeai-tidl-tools
Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided …
☆167Updated last month
jeho-lee / Awesome-On-Device-AI-Systems
☆70Updated last month
NVIDIA-AI-IOT / jetson_dla_tutorial
A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson
☆339Updated 3 years ago
ybai789 / yolov8-prune-network-slimming
A network slimming-based pruning method for YOLOv8.
☆35Updated last year
Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆97Updated last year
xxxxyu / FlexNN
Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"
☆54Updated 6 months ago
NVIDIA-AI-IOT / cuDLA-samples
YOLOv5 on Orin DLA
☆207Updated last year
NVlabs / HALP
☆68Updated 2 years ago
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆91Updated last year
mit-han-lab / tiny-training
On-Device Training Under 256KB Memory [NeurIPS'22]
☆483Updated last year
mit-han-lab / mcunet
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…
☆589Updated last year
Qualcomm-AI-research / pruning-vs-quantization
☆22Updated last year
hrcheng1066 / awesome-pruning
☆266Updated 11 months ago
chengtao-lv / PTQ4SAM
[CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything
☆80Updated last year
UoS-EEC / DynamicOFA
[CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms
☆29Updated 2 years ago
coderonion / awesome-cuda-and-hpc
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PT…
☆305Updated last week
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆223Updated 3 years ago
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆138Updated last year
xuke225 / EQ-Net
EQ-Net [ICCV 2023]
☆30Updated last year
SteveTsui / Q-DETR
☆35Updated last year
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆133Updated 3 years ago
Forggtensky / Quantize_Pytorch_Vgg16AndMobileNet
Quantize,Pytorch,Vgg16,MobileNet
☆42Updated 4 years ago
maggiez0138 / yolov5_quant_sample
This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…
☆105Updated 3 years ago
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
thb1314 / mmyolo_tensorrt
☆147Updated last year
levipereira / yolov9-qat
Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.
☆119Updated 3 months ago