Welcome to the 'In Context Learning Theory' Reading Group
☆30Nov 8, 2024Updated last year
Alternatives and similar repositories for Awesome_Large_Foundation_Model_Theory
Users that are interested in Awesome_Large_Foundation_Model_Theory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆58Mar 18, 2026Updated last month
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Feb 20, 2026Updated last month
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 5 months ago
- The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.☆22Jun 9, 2022Updated 3 years ago
- Clustered Compositional Embeddings☆12Oct 25, 2023Updated 2 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- ☆114Feb 25, 2025Updated last year
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Sep 16, 2024Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆17Feb 27, 2025Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- ☆13Aug 7, 2023Updated 2 years ago
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆28Dec 21, 2025Updated 3 months ago
- Implementation of the Regularized Nonlinear Acceleration algorithm☆12Oct 4, 2018Updated 7 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆27Apr 11, 2023Updated 3 years ago
- Examine the all the leakages happened from 2010-2017 and apply machine learning to detect equipment failure☆16Mar 3, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆12Jan 13, 2025Updated last year
- Grassmannian Optimization for Tensor Completion and Tracking in the t-SVD Algebra☆11Oct 7, 2025Updated 6 months ago
- ☆18Dec 9, 2020Updated 5 years ago
- Code for lin-RFM used for sparse recovery tasks☆16Mar 13, 2025Updated last year
- AnchorAttention: Improved attention for LLMs long-context training☆216Jan 15, 2025Updated last year
- Code for the paper "Randomly pivoted Cholesky: Practical approximation of a kernel matrix with few entry evaluations"☆33Dec 4, 2025Updated 4 months ago
- This project plans the welding layers, sequence, as well as all welding points (with pose in 2d) for V-shape groove.☆15Feb 4, 2021Updated 5 years ago
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)☆37Oct 30, 2024Updated last year
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆62Apr 8, 2026Updated last week
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆26Mar 27, 2026Updated 3 weeks ago
- A brief and partial summary of RLHF algorithms.☆148Mar 4, 2025Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆295Apr 10, 2024Updated 2 years ago