Clin0212 / HydraLoRALinks
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆234Updated last year
Alternatives and similar repositories for HydraLoRA
Users that are interested in HydraLoRA are comparing it to the libraries listed below
Sorting:
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆164Updated 6 months ago
- ☆125Updated last year
- Awesome-Low-Rank-Adaptation☆126Updated last year
- ☆152Updated last year
- ☆173Updated last year
- ☆216Updated last month
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆201Updated last year
- One-shot Entropy Minimization☆187Updated 7 months ago
- Awesome Low-Rank Adaptation☆59Updated 5 months ago
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆84Updated last year
- ☆62Updated last year
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆153Updated 6 months ago
- ☆28Updated last year
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆80Updated 2 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆57Updated 8 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆74Updated 10 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆290Updated last week
- Paper List of Inference/Test Time Scaling/Computing☆339Updated 4 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Updated last year
- ☆195Updated last year
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆66Updated 2 weeks ago
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆75Updated 6 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆53Updated last year
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆45Updated 6 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆131Updated 10 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆142Updated 9 months ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆91Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 11 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- Code release for VTW (AAAI 2025 Oral)☆64Updated 2 months ago