☆137Apr 9, 2026Updated last week
Alternatives and similar repositories for Awesome-On-Device-AI-Systems
Users that are interested in Awesome-On-Device-AI-Systems are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Let's use Qualcomm NPU in Android☆18Feb 18, 2025Updated last year
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆61Sep 30, 2025Updated 6 months ago
- Miro[ACM MobiCom '23] Cost-effective On-device Continual Learning over Memory Hierarchy with Miro☆16Feb 1, 2024Updated 2 years ago
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆49Feb 13, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A template-based, layer-oriented High Level Synthesis Tool for AI algorithms☆14Mar 24, 2026Updated 3 weeks ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- Fast Multimodal LLM on Mobile Devices☆1,470Apr 12, 2026Updated last week
- ☆15Aug 19, 2024Updated last year
- LLM inference in C/C++☆21Oct 22, 2025Updated 5 months ago
- ☆212Jan 17, 2024Updated 2 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- This repo contains the code developed for my master thesis☆13Mar 24, 2022Updated 4 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Protecting Accelerator Execution with Arm Confidential Computing Architecture (USENIX Security 2024)☆27Dec 11, 2023Updated 2 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- 🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.☆260Feb 13, 2026Updated 2 months ago
- ☆10Oct 5, 2023Updated 2 years ago
- ☆78May 28, 2023Updated 2 years ago
- ☆14Nov 3, 2025Updated 5 months ago
- The artifact for NDSS '25 paper "ASGARD: Protecting On-Device Deep Neural Networks with Virtualization-Based Trusted Execution Environmen…☆15Oct 16, 2025Updated 6 months ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆33Aug 18, 2023Updated 2 years ago
- 🔥🔥🔥 Latest works on video streaming/processing/analysis☆111Nov 5, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The note of Qualcomm OpenCL SDK☆37Nov 8, 2018Updated 7 years ago
- ☆21Oct 2, 2024Updated last year
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- Latest Advances on Federated LLM Learning☆109Jul 7, 2025Updated 9 months ago
- ☆15Feb 23, 2025Updated last year
- ☆34Nov 7, 2022Updated 3 years ago
- ☆16Mar 9, 2021Updated 5 years ago
- On-Device Learning for Low-Power IoT Devices☆14Aug 12, 2023Updated 2 years ago
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Mar 6, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- OpenCCA: An Open Framework to Enable Arm CCA Research☆21Sep 10, 2025Updated 7 months ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆29May 10, 2021Updated 4 years ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆137Updated this week
- ☆123Sep 22, 2025Updated 6 months ago
- ☆16Jan 5, 2024Updated 2 years ago
- GPUReplay, ASPLOS 2022☆41Feb 21, 2022Updated 4 years ago
- A Distributed Analysis and Benchmarking Framework for Apache OpenWhisk Serverless Platform☆12Dec 11, 2018Updated 7 years ago