mrsnu / band
Multi-DNN Inference Engine for Heterogeneous Mobile Processors
☆25Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for band
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆20Updated 3 years ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆11Updated 2 years ago
- ☆188Updated 10 months ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆58Updated 3 weeks ago
- ☆74Updated last year
- Multi-branch model for concurrent execution☆16Updated last year
- MobiSys#114☆21Updated last year
- This is a list of awesome edgeAI inference related papers.☆88Updated 11 months ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆12Updated last year
- NeuPIMs Simulator☆54Updated 5 months ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆22Updated last year
- ☆41Updated last year
- ☆39Updated last month
- ☆16Updated last year
- LLM serving cluster simulator☆81Updated 6 months ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆26Updated last year
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆54Updated 7 months ago
- ☆37Updated 3 years ago
- OwLite Examples repository offers illustrative example codes to help users seamlessly compress PyTorch deep learning models and transform…☆9Updated last month
- ☆100Updated last year
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆67Updated last week
- ☆23Updated last year
- ☆45Updated last year
- ☆48Updated last year
- This repository is established to store personal notes and annotated papers during daily research.☆90Updated this week
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆8Updated 8 months ago
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆36Updated 8 months ago
- ☆100Updated last month