parrotsky / AutoDiCELinks
distributed CNN inference at the edge, extend ncnn with CUDA, MPI+OPENMP support.
☆22Updated 5 months ago
Alternatives and similar repositories for AutoDiCE
Users that are interested in AutoDiCE are comparing it to the libraries listed below
Sorting:
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆37Updated 2 years ago
- This is a list of awesome edgeAI inference related papers.☆98Updated 2 years ago
- TQT's pytorch implementation.☆21Updated 4 years ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆88Updated 5 years ago
- ☆78Updated 2 years ago
- [MobiCom 24] Efficient and Adaptive DNN inference under changeable memory budgets☆58Updated last year
- [DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive La…☆81Updated last year
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆23Updated 6 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆25Updated 3 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Updated 2 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Updated 3 years ago
- Fast NPU-aware Neural Architecture Search☆22Updated 4 years ago
- MobiSys#114☆23Updated 2 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆30Updated 3 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆19Updated 5 months ago
- An external memory allocator example for PyTorch.☆16Updated 5 months ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 4 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Updated 2 years ago
- ☆33Updated 2 years ago
- ☆14Updated 4 years ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆28Updated 4 years ago
- About DNN compression and acceleration on Edge Devices.☆57Updated 4 years ago
- play gemm with tvm☆92Updated 2 years ago
- ☆21Updated 4 years ago
- Quantize pytorch model, support post-training quantization and quantization aware training methods☆14Updated 2 years ago
- Experimental deep learning framework written in Rust☆15Updated 3 years ago
- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution☆17Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆28Updated 4 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 5 years ago
- ☆37Updated 3 years ago