pilancilab / Riemannian_Preconditioned_LoRALinks
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆32Updated last year
Alternatives and similar repositories for Riemannian_Preconditioned_LoRA
Users that are interested in Riemannian_Preconditioned_LoRA are comparing it to the libraries listed below
Sorting:
- Deep Learning & Information Bottleneck☆62Updated 2 years ago
- ☆31Updated 5 months ago
- Bayesian Low-Rank Adaptation for Large Language Models☆36Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆36Updated last year
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆35Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated last year
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆90Updated 2 years ago
- ☆20Updated 2 weeks ago
- Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]☆30Updated last month
- ☆19Updated 7 months ago
- ☆18Updated 7 months ago
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆24Updated 2 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Updated 10 months ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated 2 years ago
- ☆18Updated last year
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆42Updated last year
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated 2 years ago
- ☆72Updated 11 months ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆46Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆51Updated last year
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Updated 2 years ago
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆73Updated last year
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆39Updated 2 months ago
- ☆11Updated 3 months ago
- Unofficial Implementation of Selective Attention Transformer☆17Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 4 years ago
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)☆37Updated last year
- This is the project for IRM methods☆13Updated 4 years ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆21Updated last month
- Pytorch code for experiments on Linear Transformers☆23Updated last year