Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆131Updated last month
Alternatives and similar repositories for Subspace-Tuning:
Users that are interested in Subspace-Tuning are comparing it to the libraries listed below
- ☆182Updated 5 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆106Updated last week
- ☆100Updated 8 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆92Updated 4 months ago
- ☆131Updated 8 months ago
- ☆142Updated 6 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆175Updated 3 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆44Updated 2 months ago
- Survey on Data-centric Large Language Models☆80Updated 8 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆55Updated last month
- Code release for VTW (AAAI 2025) Oral☆32Updated 2 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆98Updated last week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆116Updated 4 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆121Updated 10 months ago
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆334Updated last month
- Awesome-Low-Rank-Adaptation☆83Updated 5 months ago
- ☆170Updated 8 months ago
- [EMNLP 2023 Main] Sparse Low-rank Adaptation of Pre-trained Language Models☆72Updated last year
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆149Updated 7 months ago
- ☆139Updated last week
- qwen-nsa☆42Updated last week
- Paper List of Inference/Test Time Scaling/Computing☆127Updated last week
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆96Updated last month
- 🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆75Updated 3 months ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆102Updated 5 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆78Updated last month
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆19Updated last month
- [SIGIR'24] The official implementation code of MOELoRA.☆153Updated 8 months ago
- [ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation☆116Updated 2 months ago
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆145Updated last week