mrsnu / band
Multi-DNN Inference Engine for Heterogeneous Mobile Processors
☆23Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for band
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆20Updated 3 years ago
- ☆74Updated last year
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆11Updated 2 years ago
- MobiSys#114☆21Updated last year
- ☆188Updated 9 months ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆55Updated 2 weeks ago
- Multi-branch model for concurrent execution☆16Updated last year
- This is a list of awesome edgeAI inference related papers.☆88Updated 10 months ago
- Experimental deep learning framework written in Rust☆13Updated 2 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆53Updated 7 months ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆12Updated last year
- ☆41Updated last year
- ☆37Updated 3 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆25Updated last year
- ☆101Updated last year
- Cache design for CNN on mobile☆32Updated 6 years ago
- Study Group of Deep Learning Compiler☆152Updated last year
- ☆22Updated last year
- [MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization☆15Updated 4 years ago
- ☆18Updated 2 years ago
- NeuPIMs Simulator☆51Updated 4 months ago
- LLM serving cluster simulator☆74Updated 6 months ago
- ☆16Updated last year
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆22Updated last year
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆25Updated 3 years ago
- Multi-Instance-GPU profiling tool☆53Updated last year
- ☆45Updated last year
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆80Updated 4 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆54Updated 2 months ago