Survey Paper List - Efficient LLM and Foundation Models
☆265Sep 22, 2024Updated last year
Alternatives and similar repositories for Efficient_Foundation_Model_Survey
Users that are interested in Efficient_Foundation_Model_Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆102Jan 17, 2024Updated 2 years ago
- Paper list for Personal LLM Agents☆429May 8, 2024Updated 2 years ago
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Aug 18, 2023Updated 2 years ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,257Jun 23, 2025Updated 11 months ago
- Fast Multimodal LLM on Mobile Devices☆1,515Apr 30, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆67Nov 16, 2024Updated last year
- Our unique contributions are in tools/train/benchmark.☆22Apr 14, 2025Updated last year
- Efficient Multimodal Large Language Models: A Survey☆386Apr 29, 2025Updated last year
- ☆36May 28, 2024Updated last year
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆70Aug 9, 2024Updated last year
- A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing☆45Apr 25, 2026Updated 3 weeks ago
- 📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉☆5,229Apr 20, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- A curated list for Efficient Large Language Models☆2,008Jun 17, 2025Updated 11 months ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Butler 是一个用于自动化服务管理和任务调度的工具项目。☆16May 16, 2026Updated last week
- A resilient distributed training framework☆100Apr 11, 2024Updated 2 years ago
- Shepherd: A foundational framework enabling federated instruction tuning for large language models☆248Jul 7, 2023Updated 2 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in He…☆68Mar 12, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code release for VTW (AAAI 2025 Oral)☆67Nov 4, 2025Updated 6 months ago
- [ACL 2021] IrEne: Interpretable Energy Prediction for Transformers☆11Sep 8, 2021Updated 4 years ago
- [ACL 2026 (Main)] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆83Jul 14, 2025Updated 10 months ago
- Awesome LLM compression research papers and tools.☆1,833Feb 23, 2026Updated 3 months ago
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆30Oct 1, 2024Updated last year
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- Pytorch implementations of Client-Customized Adaptation for Parameter-Efficient Federated Learning (Findings of ACL: ACL 2023)☆17Oct 9, 2023Updated 2 years ago
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆126Jul 6, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generated geosite.dat based on Antifilter Community List☆27May 17, 2026Updated last week
- [AAAI 2024] FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning☆66Jan 21, 2024Updated 2 years ago
- Work in progress LLM framework.☆16Oct 31, 2024Updated last year
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆19Jul 19, 2024Updated last year
- a curated list of high-quality papers on resource-efficient LLMs 🌱☆162Mar 15, 2025Updated last year
- ☆85Oct 9, 2024Updated last year