nsfzyzz / Generalization_metrics_for_NLPView external linksLinks
[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arxiv.org/pdf/2202.02842.pdf
☆12Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for Generalization_metrics_for_NLP
Users that are interested in Generalization_metrics_for_NLP are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated 10 months ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 2 months ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆31Jun 9, 2025Updated 8 months ago
- Localizing Memorized Sequences in Language Models☆20Oct 15, 2025Updated 3 months ago
- ☆18Nov 10, 2024Updated last year
- ☆18Mar 25, 2021Updated 4 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts☆24Nov 1, 2025Updated 3 months ago
- ☆12Aug 21, 2020Updated 5 years ago
- ☆13Sep 13, 2015Updated 10 years ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 4 months ago
- Reproducing TracIn (Tracing Gradient Descent) using PyTorch☆11Nov 17, 2021Updated 4 years ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 9 months ago
- V-Mapper -☆10Aug 6, 2023Updated 2 years ago
- ☆13Apr 13, 2021Updated 4 years ago
- ☆12Mar 4, 2024Updated last year
- Code for Characterizing Scaling and Transfer Learning Behavior of FNO in SciML☆51May 31, 2023Updated 2 years ago
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆25Jan 22, 2026Updated 3 weeks ago
- ☆11May 10, 2018Updated 7 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 3 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- ☆13Dec 21, 2021Updated 4 years ago
- Generates and optimizes Haiku system and user prompts for classification☆14Oct 27, 2025Updated 3 months ago
- Implementation of Poincare Embedding in PyTorch☆13Jul 27, 2017Updated 8 years ago
- Landscaper is a comprehensive Python framework designed for exploring the loss landscapes of deep learning models.☆24Jan 27, 2026Updated 2 weeks ago
- ☆11Sep 9, 2024Updated last year
- ☆12Feb 14, 2017Updated 9 years ago
- fast trainer for educational purposes☆23Feb 5, 2026Updated last week
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- Code for the paper "SelectiveNet: A Deep Neural Network with an Integrated Reject Option"☆12Jan 26, 2019Updated 7 years ago
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆14Feb 22, 2022Updated 3 years ago
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 4 years ago
- rest2vec: Vectorizing the resting-state functional connectome using graph embedding☆10Jul 6, 2023Updated 2 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Jul 20, 2025Updated 6 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 8 months ago
- Torch implementation of orthoreg.☆15Oct 27, 2021Updated 4 years ago