parrotsky / AutoDiCE
distributed CNN inference at the edge, extend ncnn with CUDA, MPI+OPENMP support.
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for AutoDiCE
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆27Updated 9 months ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆80Updated 4 years ago
- This is a list of awesome edgeAI inference related papers.☆88Updated 10 months ago
- MobiSys#114☆21Updated last year
- InFi is a library for building input filters for resource-efficient inference.☆37Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- ☆94Updated 9 months ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆10Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- Experimental deep learning framework written in Rust☆13Updated 2 years ago
- Multi-branch model for concurrent execution☆16Updated last year
- To deploy Transformer models in CV to mobile devices.☆18Updated 2 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆39Updated last year
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆25Updated 3 years ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆18Updated 2 years ago
- ☆38Updated 4 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆33Updated last year
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆30Updated last year
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆23Updated last year
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆37Updated 3 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆24Updated 2 years ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆24Updated 10 months ago
- ☆14Updated 2 years ago
- An external memory allocator example for PyTorch.☆13Updated 3 years ago
- ☆20Updated last year
- Simple PyTorch graph capturing.☆13Updated last year
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆10Updated 3 weeks ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆10Updated 3 months ago