☆78May 28, 2023Updated 2 years ago
Alternatives and similar repositories for CoDL
Users that are interested in CoDL are comparing it to the libraries listed below
Sorting:
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- Multi-DNN Inference Engine for Heterogeneous Mobile Processors☆38Jul 24, 2024Updated last year
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- ☆212Jan 17, 2024Updated 2 years ago
- image to column☆30Jul 15, 2014Updated 11 years ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- ☆22Feb 18, 2025Updated last year
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- The note of Qualcomm OpenCL SDK☆37Nov 8, 2018Updated 7 years ago
- MLPerf Mobile benchmarks☆15Jan 27, 2026Updated last month
- ☆16Oct 3, 2023Updated 2 years ago
- ☆18Oct 15, 2020Updated 5 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- [MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization☆15Jun 9, 2020Updated 5 years ago
- LLM inference in C/C++☆20Oct 22, 2025Updated 4 months ago
- ☆15Jul 25, 2023Updated 2 years ago
- ☆23Feb 18, 2022Updated 4 years ago
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆27Dec 6, 2024Updated last year
- A profiler to disclose and quantify hardware features on GPUs.☆176May 15, 2022Updated 3 years ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆17Mar 25, 2022Updated 3 years ago
- The source code of INFless,a native serverless platform for AI inference.☆46Oct 10, 2022Updated 3 years ago
- ☆126Feb 12, 2026Updated 3 weeks ago
- [DATE'23] The official code for paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>☆23Jan 19, 2026Updated last month
- Artifacts for the "Caching with Delayed Hits" paper that appears in SIGCOMM '20.☆21Feb 26, 2021Updated 5 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆363Jul 30, 2024Updated last year
- CSR-based SpGEMM on nVidia and AMD GPUs☆47Apr 9, 2016Updated 9 years ago
- Fast Multimodal LLM on Mobile Devices☆1,412Updated this week
- SocksDirect code repository☆19Jun 26, 2022Updated 3 years ago
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 7 months ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- ☆57Dec 8, 2021Updated 4 years ago
- ☆102Jan 17, 2024Updated 2 years ago