A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
☆392Jan 7, 2026Updated 2 months ago
Alternatives and similar repositories for awesome-deep-phenomena
Users that are interested in awesome-deep-phenomena are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆98Dec 2, 2024Updated last year
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆206Dec 27, 2024Updated last year
- ☆25Feb 20, 2026Updated 2 weeks ago
- ☆74Dec 7, 2024Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 4 months ago
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- awesome papers in LLM interpretability☆609Aug 20, 2025Updated 6 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆294Jan 22, 2026Updated last month
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 6 months ago
- ☆13Oct 7, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆78Apr 3, 2024Updated last year
- A curated list of awesome papers on dataset distillation and related applications.☆1,904Updated this week
- This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.☆391Feb 12, 2026Updated 3 weeks ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆294Apr 10, 2024Updated last year
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆59Oct 10, 2023Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62May 11, 2021Updated 4 years ago
- ☆17Mar 23, 2025Updated 11 months ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- ☆18Jul 24, 2023Updated 2 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆29Sep 22, 2023Updated 2 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆85Jun 20, 2023Updated 2 years ago
- ☆17Feb 4, 2025Updated last year
- [NeurIPS 2022] The official code for our NeurIPS 2022 paper "Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnab…☆49Oct 12, 2022Updated 3 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- code for ICML 2021 paper in which we explore the relationship between adversarial transferability and knowledge transferability.☆17Dec 8, 2022Updated 3 years ago
- ☆242May 10, 2024Updated last year
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆400Mar 2, 2025Updated last year
- ☆53May 20, 2024Updated last year
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆983Jan 30, 2024Updated 2 years ago