nsfzyzz / Generalization_metrics_for_NLPLinks

[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arxiv.org/pdf/2202.02842.pdf

☆12

Alternatives and similar repositories for Generalization_metrics_for_NLP

Users that are interested in Generalization_metrics_for_NLP are comparing it to the libraries listed below

Sorting:

YefanZhou / TempBalance
[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
☆35Updated 2 months ago
nsfzyzz / loss_landscape_taxonomy
[NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228
☆19Updated 3 years ago
aw31 / empirical-ntks
Efficient empirical NTKs in PyTorch
☆18Updated 3 years ago
msakarvadia / memorization
Localizing Memorized Sequences in Language Models
☆16Updated 3 months ago
pomonam / kronfluence
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
☆156Updated this week
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆102Updated 2 years ago
WeiHuang05 / Awesome_Large_Foundation_Model_Theory
Welcome to the 'In Context Learning Theory' Reading Group
☆28Updated 7 months ago
alvin-zyl / CoLA
Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆22Updated 4 months ago
reds-lab / LAVA
This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
☆48Updated last year
mmatena / model_merging
☆69Updated 3 years ago
locuslab / edge-of-stability
☆68Updated 6 months ago
TRAIS-Lab / dattri
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
☆77Updated 2 weeks ago
mueller-mp / SAM-ON
☆34Updated last year
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆211Updated 7 months ago
rhubarbwu / linguistic-collapse
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]
☆13Updated 2 months ago
alstonlo / torch-influence
A simple PyTorch implementation of influence functions.
☆88Updated last year
tml-epfl / sam-low-rank-features
Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]
☆28Updated last year
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆123Updated last year
r-three / mats
☆31Updated 11 months ago
ZaydH / influence_analysis_papers
Influence Analysis and Estimation - Survey, Papers, and Taxonomy
☆79Updated last year
KellerJordan / REPAIR
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
☆47Updated last year
zyushun / hessian-spectrum
Code for the paper: Why Transformers Need Adam: A Hessian Perspective
☆59Updated 3 months ago
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆70Updated 8 months ago
IlanPrice / DCTpS
Code for testing DCT plus Sparse (DCTpS) networks
☆14Updated 4 years ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated last year
cjyaras / deep-lora-transformers
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆13Updated 11 months ago
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆45Updated 8 months ago
ppope / dimensions
Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J
☆69Updated last year
princeton-polaris-lab / Evaluating-Durable-Safeguards
[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
☆13Updated last week
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆164Updated last year