roymiles / VeLoRA
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆17Updated 4 months ago
Alternatives and similar repositories for VeLoRA:
Users that are interested in VeLoRA are comparing it to the libraries listed below
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆29Updated 4 months ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆55Updated last year
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆51Updated 5 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆76Updated 10 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆18Updated 2 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆51Updated 2 months ago
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆36Updated 7 months ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆26Updated last year
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- ☆50Updated last year
- ☆37Updated last year
- ☆42Updated last year
- Code for T-MARS data filtering☆35Updated last year
- ☆57Updated 2 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆27Updated 11 months ago
- Codebase for adaptive continual memory☆13Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- ☆34Updated last year
- ☆21Updated 2 years ago
- [CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".☆18Updated 2 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆18Updated 5 months ago
- ☆23Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆50Updated 3 months ago
- This repository contains the code for our CVPR 2022 paper on "Non-isotropy Regularization for Proxy-based Deep Metric Learning".☆14Updated last year
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆28Updated 2 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- ☆35Updated 2 years ago
- Test-Time Distribution Normalization For Contrastively Learned Vision-language Models☆27Updated last year