KAIWEILIUCC / Awesome-LLM-IoT-PapersLinks
A collection of papers on LLM applications in the IoT field.
☆17Updated this week
Alternatives and similar repositories for Awesome-LLM-IoT-Papers
Users that are interested in Awesome-LLM-IoT-Papers are comparing it to the libraries listed below
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆29Updated last year
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Updated 2 months ago
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆270Updated 3 months ago
- A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI☆254Updated last week
- ☆15Updated 8 months ago
- Survey Paper List - Efficient LLM and Foundation Models☆258Updated last year
- ☆18Updated last year
- [ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs☆79Updated 2 months ago
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆276Updated 7 months ago
- [INFOCOM 2024 Top 1 Popular Article] Agglomerative Federated Learning: Empowering Larger Model Training via End-Edge-Cloud Collaboration☆75Updated 11 months ago
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,288Updated 11 months ago
- ☆17Updated 2 years ago
- ☆30Updated last year
- Awesome LLMs on Device: A Comprehensive Survey☆1,288Updated 11 months ago
- InFi is a library for building input filters for resource-efficient inference.☆41Updated 2 years ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆205Updated last week
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆346Updated this week
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆343Updated last month
- Code for CVPR24 Paper - Resource-Efficient Transformer Pruning for Finetuning of Large Models☆12Updated last month
- This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).☆373Updated 4 months ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆42Updated last year
- ☆211Updated last year
- The framework to prune LLMs to any size and any config.☆94Updated last year
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆238Updated last year
- ☆102Updated last year
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆273Updated last year
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆94Updated last year
- The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer…☆905Updated last year
- A scalable, end-to-end training pipeline for general-purpose agents☆362Updated 5 months ago
- YiJian-Comunity: a full-process automated large model safety evaluation tool designed for academic research☆113Updated this week