parrotsky / AutoDiCE
distributed CNN inference at the edge, extend ncnn with CUDA, MPI+OPENMP support.
☆19Updated last year
Alternatives and similar repositories for AutoDiCE:
Users that are interested in AutoDiCE are comparing it to the libraries listed below
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆27Updated 9 months ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆81Updated 4 years ago
- This is a list of awesome edgeAI inference related papers.☆88Updated 11 months ago
- MobiSys#114☆21Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- InFi is a library for building input filters for resource-efficient inference.☆37Updated last year
- To deploy Transformer models in CV to mobile devices.☆18Updated 2 years ago
- Create tiny ML systems for on-device learning.☆20Updated 3 years ago
- Simple PyTorch graph capturing.☆14Updated last year
- Multi-branch model for concurrent execution☆16Updated last year
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- ☆74Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- ☆24Updated 2 years ago
- An external memory allocator example for PyTorch.☆13Updated 3 years ago
- ☆18Updated 8 months ago
- Open-source artifacts and codes of our MICRO'23 paper titled “Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi…☆32Updated last year
- ☆18Updated 2 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆10Updated last year
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆19Updated 9 months ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆54Updated 8 months ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆77Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated last year
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆25Updated 3 years ago
- ☆38Updated 4 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆29Updated 3 months ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆29Updated 2 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆26Updated last year
- ☆14Updated 3 months ago