KAIWEILIUCC / Awesome-LLM-IoT-PapersLinks
A collection of papers on LLM applications in the IoT field.
☆17Updated this week
Alternatives and similar repositories for Awesome-LLM-IoT-Papers
Users that are interested in Awesome-LLM-IoT-Papers are comparing it to the libraries listed below
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆29Updated last year
- A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI☆225Updated last week
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Updated last month
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆266Updated 3 months ago
- ☆15Updated 7 months ago
- A curated paper list and taxonomy of efficient Vision-Language-Action (VLA) models for embodied manipulation.☆50Updated 2 weeks ago
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆336Updated this week
- ☆17Updated 2 years ago
- Survey Paper List - Efficient LLM and Foundation Models☆257Updated last year
- [ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs☆79Updated last month
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆198Updated this week
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,279Updated 10 months ago
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆338Updated 3 weeks ago
- ☆18Updated last year
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆272Updated 6 months ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆56Updated 10 months ago
- 📚 Collection of token-level model compression resources.☆183Updated 2 months ago
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆238Updated last year
- Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"☆13Updated last year
- ☆30Updated last year
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆557Updated 4 months ago
- ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy☆306Updated 6 months ago
- Efficient Multimodal Large Language Models: A Survey☆375Updated 7 months ago
- The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer…☆899Updated last year
- [CVPR2024] ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments☆42Updated last year
- [ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"☆302Updated 10 months ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆248Updated 4 months ago
- [ACM CSUR 2025] Out-of-Distribution Detection: A Task-Oriented Survey of Recent Advances☆153Updated 3 months ago
- The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025☆271Updated 6 months ago
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆97Updated 5 months ago