wangxb96 / Awesome-EdgeAILinks
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
☆87Updated 6 months ago
Alternatives and similar repositories for Awesome-EdgeAI
Users that are interested in Awesome-EdgeAI are comparing it to the libraries listed below
Sorting:
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆204Updated last year
- A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlo…☆64Updated last year
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- ☆22Updated last year
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided …☆166Updated 2 weeks ago
- A network slimming-based pruning method for YOLOv8.☆35Updated last year
- ☆61Updated last month
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆69Updated last year
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆92Updated last year
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆55Updated 5 months ago
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆337Updated 3 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆584Updated last year
- EQ-Net [ICCV 2023]☆29Updated last year
- About DNN compression and acceleration on Edge Devices.☆55Updated 4 years ago
- Edge AI Software and Development Tools☆147Updated 3 months ago
- ☆66Updated 2 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆56Updated 2 years ago
- YOLOv5 on Orin DLA☆205Updated last year
- ☆206Updated 3 years ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆133Updated 3 years ago
- ☆263Updated 10 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆116Updated 2 months ago
- On-Device Training Under 256KB Memory [NeurIPS'22]☆483Updated last year
- distributed CNN inference at the edge, extend ncnn with CUDA, MPI+OPENMP support.☆19Updated 2 years ago
- [ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers☆14Updated 2 years ago
- 🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PT…☆292Updated last week
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆406Updated last week
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆56Updated last year
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆98Updated last year