allen4747 / FerretLinks
This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
☆15Updated 11 months ago
Alternatives and similar repositories for Ferret
Users that are interested in Ferret are comparing it to the libraries listed below
Sorting:
- ☆90Updated 8 months ago
- Implementation for PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs☆22Updated last year
- [ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models☆101Updated last year
- A curated list of Model Merging methods.☆92Updated 11 months ago
- Federated Learning - PyTorch☆14Updated 4 years ago
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆19Updated last year
- FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion (NeurIPS 2024 Spotlight)☆12Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆35Updated 4 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆40Updated 10 months ago
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆42Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆67Updated 6 months ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆110Updated last month
- Official implementation of "TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization" (Findings of ACL …☆18Updated last month
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆35Updated last year
- Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning☆20Updated 2 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆88Updated 10 months ago
- [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference☆44Updated last year
- The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".☆74Updated 5 months ago
- ☆38Updated 10 months ago
- ☆15Updated 9 months ago
- Shepherd: A foundational framework enabling federated instruction tuning for large language models☆241Updated 2 years ago
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆67Updated last year
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?☆115Updated 10 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆129Updated 4 months ago
- ☆49Updated 8 months ago
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆104Updated last week
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆52Updated 6 months ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated 2 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆15Updated last year
- Federated Transformer (NeurIPS 24): a framework to enhance the performance of multi-party Vertical Federated Learning involving fuzzy ide…☆38Updated 8 months ago