A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
☆397Jan 7, 2026Updated 3 months ago
Alternatives and similar repositories for awesome-deep-phenomena
Users that are interested in awesome-deep-phenomena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of trustworthy deep learning papers. Daily updating...☆384Apr 7, 2026Updated last week
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- ☆26Feb 20, 2026Updated 2 months ago
- ☆74Dec 7, 2024Updated last year
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆98Dec 2, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features☆62Jul 19, 2022Updated 3 years ago
- ☆13Mar 29, 2024Updated 2 years ago
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 5 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆80Apr 3, 2024Updated 2 years ago
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago
- awesome papers in LLM interpretability☆616Aug 20, 2025Updated 7 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- [NeurIPS 2022] The official code for our NeurIPS 2022 paper "Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnab…☆48Oct 12, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆301Jan 22, 2026Updated 2 months ago
- This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.☆393Feb 12, 2026Updated 2 months ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62May 11, 2021Updated 4 years ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆295Apr 10, 2024Updated 2 years ago
- The is the official implementation of ICCV 2023 paper "No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Syn…☆29Oct 27, 2023Updated 2 years ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆59Oct 10, 2023Updated 2 years ago
- ☆17Feb 4, 2025Updated last year
- ☆17Mar 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆28Mar 23, 2025Updated last year
- ☆244May 10, 2024Updated last year
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆78Updated this week
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"☆60May 12, 2024Updated last year
- Editing Models with Task Arithmetic☆537Jan 11, 2024Updated 2 years ago
- nanoGPT-like codebase for LLM training☆117Nov 7, 2025Updated 5 months ago
- EPFL Course - Optimization for Machine Learning - CS-439☆1,424Updated this week
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆893Updated this week
- Official Code for ICLR2022 Paper: Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap☆28Sep 28, 2025Updated 6 months ago
- ☆20Nov 27, 2022Updated 3 years ago