[ArXiv 2025] A curated list of papers on on-device large language models, focusing on model compression and system optimization techniques from the survey "On-Device Large Language Models: A Survey of Model Compression and System Optimization".
☆30Jan 27, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-On-Device-LLMs
Users that are interested in Awesome-On-Device-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Mar 14, 2025Updated last year
- ☆12May 13, 2025Updated 10 months ago
- 致力于AI for science的交叉学科融合。☆11Aug 18, 2024Updated last year
- ☆13Oct 17, 2024Updated last year
- ☆16Feb 7, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆15Sep 22, 2023Updated 2 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- Python utility to convert PyTorch model weights from '.bin' to '.safetensors' format.☆18Sep 19, 2025Updated 6 months ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- airsdk.dev website source - News information and documentation on the AIR SDK.☆59Updated this week
- A real-world multi-view event-based action recognition benchmark released by the paper "Hypergraph-Based Multi-View Action Recognition Us…☆16Sep 19, 2024Updated last year
- Bjontegaard metric calculation. Include BD-PSNR and BD-rate☆13Sep 4, 2024Updated last year
- Neural Engine, 16 input channels☆16Oct 31, 2022Updated 3 years ago
- ☆19Nov 30, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 计算机毕业设计之Python+Vue.js协同过滤算法混合新闻推荐系统 新闻网站 新闻发布系统☆16Dec 27, 2021Updated 4 years ago
- whatever it means☆15Updated this week
- ☆27Jul 13, 2022Updated 3 years ago
- Streamlit中文翻译与教程☆17Dec 26, 2021Updated 4 years ago
- ☆42Feb 14, 2026Updated last month
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆20Nov 14, 2025Updated 4 months ago
- Bjontegaard metric computation in the python language☆17Oct 9, 2020Updated 5 years ago
- TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image☆80Dec 30, 2025Updated 2 months ago
- LLM inference in C/C++☆21Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆35Feb 14, 2026Updated last month
- 和风天气API调用python实现☆18May 8, 2023Updated 2 years ago
- ☆34Nov 24, 2025Updated 4 months ago
- ☆22May 22, 2024Updated last year
- Yet another `llama.cpp` Rust wrapper☆12Jun 19, 2024Updated last year
- (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD++).☆40Jan 18, 2023Updated 3 years ago
- [Re] Object Detection Meets Knowledge Graphs☆21Feb 3, 2023Updated 3 years ago
- The TikTok Research API Wrapper is a ToolKit built in R and Python to facilitate the use of TikTok Research API☆31Dec 4, 2024Updated last year
- Implementation of Latent Replay, a Continual Learning strategy for Real-Time / On The Edge applications☆14May 7, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆29Feb 7, 2022Updated 4 years ago
- Collect your latest articles from sources such as dev.to, and then update the README.md☆11Updated this week
- ☆38Apr 9, 2025Updated 11 months ago
- 此仓库是我在学习MySQL中写下的笔记,我更倾向于初学者,所以我用通俗易懂的语句描述了MySQL的使用☆28Dec 28, 2024Updated last year
- web-crawling (with AngleSharp)☆12May 26, 2025Updated 10 months ago
- Android JNI for port of Facebook's LLaMA model in C/C++☆26Jun 7, 2023Updated 2 years ago
- Example apps for LeapSDK☆59Mar 12, 2026Updated 2 weeks ago