☆162Jun 21, 2026Updated last week
Alternatives and similar repositories for Awesome-On-Device-AI-Systems
Users that are interested in Awesome-On-Device-AI-Systems are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆73Sep 30, 2025Updated 8 months ago
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 3 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆52Apr 29, 2026Updated last month
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆21Jul 19, 2024Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆94Jun 8, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast Multimodal LLM on Mobile Devices☆1,552Jun 9, 2026Updated 2 weeks ago
- ☆16Aug 19, 2024Updated last year
- ☆29Feb 3, 2026Updated 4 months ago
- [ICLR 2021] CompOFA: Compound Once-For-All Networks For Faster Multi-Platform Deployment☆25Jan 5, 2023Updated 3 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- ☆215Jan 17, 2024Updated 2 years ago
- Official repository Flash Local Linear Attention☆37May 28, 2026Updated last month
- ☆103Jan 17, 2024Updated 2 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- ☆10Oct 5, 2023Updated 2 years ago
- 🤖FFPA: Extends FlashAttention-2 via Split-D for large headdims, 1.5x~3×↑🎉 vs SDPA, up to 430T🎉 on H200.☆310Jun 22, 2026Updated last week
- ☆78May 28, 2023Updated 3 years ago
- ☆14Nov 3, 2025Updated 7 months ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- ☆14Apr 27, 2022Updated 4 years ago
- Latest Advances on Federated LLM Learning☆111Jul 7, 2025Updated 11 months ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆22Jul 16, 2022Updated 3 years ago
- ☆14Feb 23, 2025Updated last year
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated 2 years ago
- ☆34Nov 7, 2022Updated 3 years ago
- ☆16Mar 9, 2021Updated 5 years ago
- OpenCCA: An Open Framework to Enable Arm CCA Research☆22Sep 10, 2025Updated 9 months ago
- ☆28Mar 14, 2024Updated 2 years ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆31May 10, 2021Updated 5 years ago
- Code for "Multilingual language models predict human reading behavior"☆12Oct 9, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆162Updated this week
- ☆130Sep 22, 2025Updated 9 months ago
- A Distributed Analysis and Benchmarking Framework for Apache OpenWhisk Serverless Platform☆12Dec 11, 2018Updated 7 years ago
- DLBlas: clean and efficient kernels☆41Updated this week
- A standalone CXL-enabled system simulator.☆21Apr 19, 2026Updated 2 months ago
- Vivado in GitLab-Runner for GitLab CI/CD☆10Oct 27, 2022Updated 3 years ago
- learning how CUDA works☆392Mar 3, 2025Updated last year