Survey Paper List - Efficient LLM and Foundation Models
☆260Sep 22, 2024Updated last year
Alternatives and similar repositories for Efficient_Foundation_Model_Survey
Users that are interested in Efficient_Foundation_Model_Survey are comparing it to the libraries listed below
Sorting:
- ☆102Jan 17, 2024Updated 2 years ago
- ☆212Jan 17, 2024Updated 2 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated last year
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,253Jun 23, 2025Updated 8 months ago
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Aug 18, 2023Updated 2 years ago
- Efficient Multimodal Large Language Models: A Survey☆388Apr 29, 2025Updated 10 months ago
- Fast Multimodal LLM on Mobile Devices☆1,412Updated this week
- ☆102Oct 2, 2024Updated last year
- ☆66Nov 16, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆35May 28, 2024Updated last year
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 3 months ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- 📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉☆5,022Updated this week
- Code release for VTW (AAAI 2025 Oral)☆64Nov 4, 2025Updated 3 months ago
- A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing☆42Oct 22, 2025Updated 4 months ago
- A curated list for Efficient Large Language Models☆1,959Jun 17, 2025Updated 8 months ago
- Shepherd: A foundational framework enabling federated instruction tuning for large language models☆246Jul 7, 2023Updated 2 years ago
- [MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Se…☆816Mar 6, 2025Updated 11 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆135Feb 22, 2024Updated 2 years ago
- CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in He…☆66Mar 12, 2025Updated 11 months ago
- Butler 是一个用于自动化服务管理和任务调度的工具项目。☆16Updated this week
- GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM☆177Jul 12, 2024Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆126May 7, 2024Updated last year
- Awesome LLM compression research papers and tools.☆1,786Feb 23, 2026Updated last week
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆127Jan 14, 2025Updated last year
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆29Oct 1, 2024Updated last year
- RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.☆56Feb 22, 2026Updated last week
- Pytorch implementations of Client-Customized Adaptation for Parameter-Efficient Federated Learning (Findings of ACL: ACL 2023)☆17Oct 9, 2023Updated 2 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- ☆19Jun 21, 2025Updated 8 months ago
- Offsite-Tuning: Transfer Learning without Full Model☆386Nov 27, 2023Updated 2 years ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆50May 12, 2024Updated last year
- A resilient distributed training framework☆97Apr 11, 2024Updated last year