[ArXiv 2025] A curated list of papers on on-device large language models, focusing on model compression and system optimization techniques from the survey "On-Device Large Language Models: A Survey of Model Compression and System Optimization".
☆31Apr 29, 2026Updated last week
Alternatives and similar repositories for Awesome-On-Device-LLMs
Users that are interested in Awesome-On-Device-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Mar 30, 2026Updated last month
- NJU操作系统:设计与实现(2022)课程笔记和代码☆10May 16, 2024Updated last year
- Awesome Agent Skills collection list, papers, tools, projects, and resources☆55Feb 16, 2026Updated 2 months ago
- ☆12May 13, 2025Updated 11 months ago
- 致力于AI for science的交叉学科融合。☆11Aug 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Oct 17, 2024Updated last year
- ☆15Sep 22, 2023Updated 2 years ago
- This is the source codes of my programming assignment of OS2019 (Operation System) courses at NJU.☆22Jun 30, 2019Updated 6 years ago
- ☆23May 2, 2026Updated last week
- ☆16Sep 27, 2023Updated 2 years ago
- Python utility to convert PyTorch model weights from '.bin' to '.safetensors' format.☆18Sep 19, 2025Updated 7 months ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- airsdk.dev website source - News information and documentation on the AIR SDK.☆60Apr 29, 2026Updated last week
- Bjontegaard metric calculation. Include BD-PSNR and BD-rate☆13Sep 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A real-world multi-view event-based action recognition benchmark released by the paper "Hypergraph-Based Multi-View Action Recognition Us…☆15Sep 19, 2024Updated last year
- Neural Engine, 16 input channels☆16Oct 31, 2022Updated 3 years ago
- ☆19Nov 30, 2025Updated 5 months ago
- 计算机毕业设计之Python+Vue.js协同过滤算法混合新闻推荐系统 新闻网站 新闻发布系统☆16Dec 27, 2021Updated 4 years ago
- whatever it means☆16Apr 1, 2026Updated last month
- ☆27Jul 13, 2022Updated 3 years ago
- Streamlit中文翻译与教程☆17Dec 26, 2021Updated 4 years ago
- ☆42Feb 14, 2026Updated 2 months ago
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆21Nov 14, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Bjontegaard metric computation in the python language☆17Oct 9, 2020Updated 5 years ago
- LLM inference in C/C++☆21May 1, 2026Updated last week
- TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image☆89Apr 27, 2026Updated last week
- ☆35Feb 14, 2026Updated 2 months ago
- 和风天气API调用python实现☆19May 8, 2023Updated 3 years ago
- Notes I prepared for final examinations at NJU CS.(我的南京大学计算机系期末复习材料)☆33Aug 22, 2021Updated 4 years ago
- ☆37Nov 24, 2025Updated 5 months ago
- ☆22May 22, 2024Updated last year
- Code for our NeurIPS´24 paper☆38Oct 28, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Yet another `llama.cpp` Rust wrapper☆12Updated this week
- (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD++).☆40Jan 18, 2023Updated 3 years ago
- [Re] Object Detection Meets Knowledge Graphs☆22Feb 3, 2023Updated 3 years ago
- The TikTok Research API Wrapper is a ToolKit built in R and Python to facilitate the use of TikTok Research API☆31Dec 4, 2024Updated last year
- Implementation of Latent Replay, a Continual Learning strategy for Real-Time / On The Edge applications☆14May 7, 2020Updated 6 years ago
- ☆29Feb 7, 2022Updated 4 years ago
- Collect your latest articles from sources such as dev.to, and then update the README.md☆11May 2, 2026Updated last week