WeiHuang05 / Awesome_Large_Foundation_Model_TheoryView external linksLinks
Welcome to the 'In Context Learning Theory' Reading Group
☆30Nov 8, 2024Updated last year
Alternatives and similar repositories for Awesome_Large_Foundation_Model_Theory
Users that are interested in Awesome_Large_Foundation_Model_Theory are comparing it to the libraries listed below
Sorting:
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆205Dec 27, 2024Updated last year
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆56Feb 4, 2026Updated last week
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated 11 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆98Dec 2, 2024Updated last year
- ☆25Apr 18, 2025Updated 9 months ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated 11 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Feb 11, 2025Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆17Feb 27, 2025Updated 11 months ago
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆26Dec 21, 2025Updated last month
- ☆27Apr 11, 2023Updated 2 years ago
- ☆106Feb 25, 2025Updated 11 months ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 3 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆390Jan 7, 2026Updated last month
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated 10 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆54Dec 17, 2025Updated last month
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)☆37Oct 30, 2024Updated last year
- A curated list of resources for activation engineering☆124Oct 2, 2025Updated 4 months ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated last year
- pytorch☆10Apr 13, 2022Updated 3 years ago
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- ☆11Feb 2, 2026Updated 2 weeks ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆10Nov 1, 2024Updated last year
- The source code for “Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering”☆10Apr 10, 2024Updated last year
- A Claude Code skill for sending messages to Feishu (飞书/Lark) via Webhook.☆23Updated this week
- Simple MoE - Day 17 of 365 Days of Repos☆16Jan 17, 2025Updated last year
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆10Oct 10, 2025Updated 4 months ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆293Apr 10, 2024Updated last year
- Official Tensorflow implementation for Deep Generative Positive-Unlabeled Learning under Selection Bias (VAE-PU) in CIKM 2020.☆14Dec 11, 2021Updated 4 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- ☆12Sep 16, 2024Updated last year