The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]
☆19Aug 4, 2022Updated 3 years ago
Alternatives and similar repositories for Mandheling-DSP-Training
Users that are interested in Mandheling-DSP-Training are comparing it to the libraries listed below
Sorting:
- Our unique contributions are in tools/train/benchmark.☆21Apr 14, 2025Updated 10 months ago
- Artifacts of EVT ASPLOS'24☆29Mar 6, 2024Updated last year
- ☆35May 28, 2024Updated last year
- A demo of end-to-end federated learning system.☆69Jun 1, 2022Updated 3 years ago
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Aug 18, 2023Updated 2 years ago
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆49Sep 30, 2025Updated 5 months ago
- Qualcomm Hexagon NN Offload Framework☆45Nov 5, 2020Updated 5 years ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆116Updated this week
- ☆48Jun 2, 2022Updated 3 years ago
- ☆212Jan 17, 2024Updated 2 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- ☆21Nov 12, 2025Updated 3 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆176May 15, 2022Updated 3 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- ☆12Apr 30, 2024Updated last year
- Shadowsocks/ShadowsocksR 账号在线监控☆12Nov 25, 2018Updated 7 years ago
- ☆13Updated this week
- ☆12Mar 1, 2025Updated last year
- ☆12Mar 18, 2024Updated last year
- The implementation for maximum clique enumeration algorithm☆11Apr 14, 2016Updated 9 years ago
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆10Nov 15, 2021Updated 4 years ago
- ☆11Sep 14, 2020Updated 5 years ago
- ☆12Feb 26, 2026Updated last week
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- 华为集合通信性能测试☆15May 27, 2024Updated last year
- ☆102Jan 17, 2024Updated 2 years ago
- Securing Deep Spiking Neural Networks against Adversarial Attacks through Inherent Structural Parameters☆13Aug 15, 2022Updated 3 years ago
- Benchmark and resources for single super-resolution algorithms☆10Apr 14, 2017Updated 8 years ago
- A test case for evaluating the performance of the workgroup reduction operation in OpenCL 2.0☆10Nov 26, 2020Updated 5 years ago
- 这里收录比较实用的计算机相关技术书籍,可以在短期之内入门的简单实用教程、一些技术网站以及一些写的比较好的博文,欢迎Fork,你也可以通过Pull Request参与编辑。☆10Jul 21, 2016Updated 9 years ago
- Unsupervised anomaly detection in the latent space of high energy physics events with quantum machine learning.☆20Oct 29, 2024Updated last year
- FPGA 2025 SAT Accel: A modern SAT Solver on FPGA Repository☆14Mar 13, 2025Updated 11 months ago
- Synthetic aperture focusing technique for optoacoustic mesoscopy and scanning acoustic microscopy.☆13Jul 24, 2024Updated last year
- Repository for AI model benchmarking on TT-Buda☆15Feb 9, 2026Updated 3 weeks ago
- GoogleNet-BN,namely InceptionNetV2 based on pytorch.☆14Apr 27, 2018Updated 7 years ago
- ☆13May 11, 2023Updated 2 years ago
- This is a personal archive. Please refer to github.com/UCLA-VAST/RapidStream☆15May 31, 2022Updated 3 years ago