pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆20Updated 6 months ago
Alternatives and similar repositories for Riemannian_Preconditioned_LoRA:
Users that are interested in Riemannian_Preconditioned_LoRA are comparing it to the libraries listed below
- Bayesian Low-Rank Adaptation for Large Language Models☆29Updated 6 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆27Updated 2 months ago
- ☆16Updated 2 months ago
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated 5 months ago
- ☆10Updated 4 months ago
- PDM-based Purifier☆19Updated 2 months ago
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆31Updated 2 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆19Updated 3 years ago
- Deep Learning & Information Bottleneck☆53Updated last year
- Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation☆44Updated last year
- ☆16Updated 6 months ago
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆40Updated 9 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data"☆24Updated 4 months ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆86Updated last year
- [NeurIPS 2024] BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models☆21Updated 3 weeks ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆18Updated last year
- ☆27Updated 6 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 7 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆25Updated last year
- ☆27Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆31Updated 2 months ago
- ☆15Updated 11 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated 10 months ago
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆35Updated 2 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆17Updated 4 months ago
- Visualization of mean field and neural tangent kernel regime☆21Updated 5 months ago
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated last year