mrsnu / band
Multi-DNN Inference Engine for Heterogeneous Mobile Processors
☆31Updated 8 months ago
Alternatives and similar repositories for band:
Users that are interested in band are comparing it to the libraries listed below
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆24Updated 3 years ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆14Updated 3 years ago
- ☆77Updated last year
- Multi-branch model for concurrent execution☆17Updated last year
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆103Updated last month
- ☆199Updated last year
- MobiSys#114☆21Updated last year
- [ACM EuroSys '23] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Updated last year
- ☆47Updated 3 months ago
- ☆16Updated last year
- ☆29Updated this week
- ☆37Updated 3 years ago
- LLM serving cluster simulator☆96Updated 11 months ago
- ☆14Updated 3 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago