nsfzyzz / Generalization_metrics_for_NLPLinks
[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arxiv.org/pdf/2202.02842.pdf
☆12Updated 2 years ago
Alternatives and similar repositories for Generalization_metrics_for_NLP
Users that are interested in Generalization_metrics_for_NLP are comparing it to the libraries listed below
Sorting:
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆160Updated 2 months ago
- A fast, effective data attribution method for neural networks in PyTorch☆217Updated 9 months ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- ☆70Updated 8 months ago
- LLM finetuning in resource-constrained environments.☆51Updated last year
- Efficient empirical NTKs in PyTorch☆22Updated 3 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 4 months ago
- ☆238Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆104Updated 2 years ago
- Simple CIFAR10 ResNet example with JAX.☆23Updated 4 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆360Updated last month
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆51Updated last year
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆84Updated 2 months ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆35Updated 3 years ago
- Neural Tangent Kernel Papers☆115Updated 7 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆176Updated last year
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆274Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Using sparse coding to find distributed representations used by neural networks.☆265Updated last year
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆41Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆124Updated last year
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆53Updated 4 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆62Updated 5 months ago
- A simple PyTorch implementation of influence functions.☆91Updated last year
- nanoGPT-like codebase for LLM training☆103Updated 3 months ago
- Distributed K-FAC preconditioner for PyTorch☆89Updated last week
- Code for steering and monitoring with concepts vectors in LLMs. https://arxiv.org/abs/2502.03708☆13Updated 3 weeks ago
- ☆238Updated 11 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆82Updated last year