MaximeRobeyns / bayesian_loraLinks
Bayesian Low-Rank Adaptation for Large Language Models
☆36Updated last year
Alternatives and similar repositories for bayesian_lora
Users that are interested in bayesian_lora are comparing it to the libraries listed below
Sorting:
- Bayesian low-rank adaptation for large language models☆28Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆108Updated 2 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆68Updated 2 years ago
- ☆79Updated 3 years ago
- ☆32Updated last year
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆36Updated 3 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Updated 3 years ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated 2 years ago
- ☆34Updated last year
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆29Updated last year
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆73Updated last year
- ☆73Updated last year
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆35Updated 2 years ago
- Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]☆31Updated 3 months ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 4 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Updated last year
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆33Updated last year
- ☆47Updated 2 years ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆38Updated last year
- A simple PyTorch implementation of influence functions.☆92Updated last year
- Deep Learning & Information Bottleneck☆63Updated 2 years ago
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated 2 years ago
- ☆111Updated 2 years ago
- ☆34Updated 2 years ago
- ☆35Updated 3 years ago
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆153Updated 3 years ago
- A simple Jax implementation of influence functions.☆20Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated 2 years ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆22Updated 3 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Updated last year