roymiles / VeLoRA
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆20Updated 6 months ago
Alternatives and similar repositories for VeLoRA:
Users that are interested in VeLoRA are comparing it to the libraries listed below
- ☆10Updated 2 months ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆56Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆30Updated 6 months ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆16Updated 4 months ago
- Data distillation benchmark☆58Updated last week
- LCA-on-the-line (ICML 2024 Oral)☆11Updated 2 months ago
- ☆21Updated 2 years ago
- Neural-etwork-parameters-with-Diffusion☆24Updated 10 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆77Updated last year
- Code for CVPR 2024 Oral "Neural Lineage"☆16Updated 10 months ago
- ☆20Updated last month
- Adapting LLaMA Decoder to Vision Transformer☆28Updated 11 months ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆51Updated 2 weeks ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆20Updated 4 months ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆22Updated 3 weeks ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆54Updated 4 months ago
- ☆41Updated 5 months ago
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆47Updated last year
- Code for T-MARS data filtering☆35Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆29Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆53Updated 7 months ago
- ☆13Updated 6 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 7 months ago
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆36Updated 9 months ago
- ☆38Updated last year
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆31Updated last month
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated 10 months ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year