r-three / matsLinks

☆32

Alternatives and similar repositories for mats

Users that are interested in mats are comparing it to the libraries listed below

Sorting:

mmatena / model_merging
☆80Updated 3 years ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated 2 years ago
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆107Updated 2 years ago
Liuhong99 / implicitbiasmlmcode
☆13Updated 2 years ago
MadryLab / datamodels-data
Data for "Datamodels: Predicting Predictions with Training Data"
☆97Updated 2 years ago
varunnair18 / FISH
Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).
☆59Updated 3 years ago
tml-epfl / understanding-sam
Towards Understanding Sharpness-Aware Minimization [ICML 2022]
☆36Updated 3 years ago
UKPLab / iclr2024-model-merging
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆29Updated last year
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆77Updated last year
TRAIS-Lab / dattri
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
☆95Updated last week
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆27Updated last year
MadryLab / journey-TRAK
Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"
☆25Updated last year
tanganke / opcm
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆22Updated last month
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆221Updated last year
mueller-mp / SAM-ON
☆34Updated last year
yoonholee / DivDis
☆39Updated 3 years ago
VITA-Group / Junk_DNA_Hypothesis
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Updated 7 months ago
milesaturpin / cot-unfaithfulness
☆51Updated 2 years ago
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Updated 2 years ago
RobertCsordas / modules
The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…
☆46Updated 2 years ago
Jiacheng-Zhu-AIML / AsymmetryLoRA
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆37Updated last year
google-research / jax-influence
☆63Updated 3 years ago
Model-GLUE / Model-GLUE
☆18Updated last year
locuslab / acr-memorization
☆37Updated 11 months ago
anniesch / jtt
Code for "Just Train Twice: Improving Group Robustness without Training Group Information"
☆72Updated last year
izmailovpavel / spurious_feature_learning
☆46Updated 2 years ago
tding1 / Neural-Collapse
[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features
☆59Updated 3 years ago
MadryLab / DsDm
☆51Updated last year
KihoPark / linear_rep_geometry
☆110Updated 9 months ago