wangxb96 / Awesome-EdgeAILinks
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
☆87Updated 5 months ago
Alternatives and similar repositories for Awesome-EdgeAI
Users that are interested in Awesome-EdgeAI are comparing it to the libraries listed below
Sorting:
- Edge AI Software and Development Tools☆145Updated 2 months ago
- A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlo…☆62Updated last year
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆68Updated last year
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆91Updated last year
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆200Updated last year
- ☆66Updated 2 years ago
- ☆202Updated last year
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- ☆51Updated 2 weeks ago
- Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided …☆164Updated last month
- distributed CNN inference at the edge, extend ncnn with CUDA, MPI+OPENMP support.☆18Updated 2 years ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆54Updated 5 months ago
- ☆22Updated last year
- Quantization Aware Training☆75Updated last year
- YOLOv5 on Orin DLA☆204Updated last year
- Jetson embedded platform-target deep learning inference acceleration framework with TensorRT☆28Updated 3 weeks ago
- ☆34Updated last year
- This repository is Onnx tutorial summary for python implements , which comes from other web resource.☆28Updated 2 years ago
- A fork of the BEVDet series .☆20Updated last year
- A network slimming-based pruning method for YOLOv8.☆34Updated last year
- [ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers☆14Updated 2 years ago
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆35Updated last year
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆25Updated 4 years ago
- TensorRT deploy and PTQ/QAT tools development for FastBEV, total time only need 6.9ms!!!☆269Updated last year
- 🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PT…☆285Updated 3 weeks ago
- ☆13Updated last year
- ☆25Updated 3 years ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆132Updated 3 years ago