roymiles / VeLoRA
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆18Updated 5 months ago
Alternatives and similar repositories for VeLoRA:
Users that are interested in VeLoRA are comparing it to the libraries listed below
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆55Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆29Updated 5 months ago
- ☆21Updated 2 years ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆54Updated 3 months ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆53Updated 7 months ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- ☆13Updated 5 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆29Updated last year
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- ☆10Updated last month
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆31Updated 3 weeks ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- Code for T-MARS data filtering☆35Updated last year
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆16Updated 3 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated 9 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆30Updated 9 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆43Updated 2 months ago
- Data distillation benchmark☆58Updated last week
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆30Updated 4 months ago
- ☆38Updated last year
- ☆14Updated last week
- LCA-on-the-line (ICML 2024 Oral)☆11Updated last month
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆19Updated 3 months ago
- Codebase for adaptive continual memory☆13Updated last year
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- SparCL: Sparse Continual Learning on the Edge @ NeurIPS 22☆29Updated last year
- ☆39Updated 4 months ago
- CLEAR benchmark (NeurIPS 2021 Dataset & Benchmark)☆26Updated last year