Welcome to the 'In Context Learning Theory' Reading Group
☆31Nov 8, 2024Updated last year
Alternatives and similar repositories for Awesome_Large_Foundation_Model_Theory
Users that are interested in Awesome_Large_Foundation_Model_Theory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆211Apr 13, 2026Updated 2 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆63Mar 18, 2026Updated 3 months ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆19Feb 20, 2025Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Feb 20, 2026Updated 3 months ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.☆22Jun 9, 2022Updated 4 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆402May 29, 2026Updated 2 weeks ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- ☆116Feb 25, 2025Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Sep 16, 2024Updated last year
- Deep convolutional tensor network☆11Sep 29, 2020Updated 5 years ago
- some codes for ploting figures☆12May 9, 2025Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- An Elegant Library for Bayesian Deep Learning in PyTorch☆27Dec 19, 2022Updated 3 years ago
- Implementation of the Regularized Nonlinear Acceleration algorithm☆13Oct 4, 2018Updated 7 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 3 years ago
- ☆27Apr 11, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for lin-RFM used for sparse recovery tasks☆17Mar 13, 2025Updated last year
- AnchorAttention: Improved attention for LLMs long-context training☆216Jan 15, 2025Updated last year
- Code for the paper "Randomly pivoted Cholesky: Practical approximation of a kernel matrix with few entry evaluations"☆35Dec 4, 2025Updated 6 months ago
- ☆13Feb 2, 2022Updated 4 years ago
- [ACL 2025] iAgent: LLM Agent as a Shield between User and Recommender Systems☆32May 23, 2025Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- Implementation of Paper: Long-term Forecasting with TiDE: Time-series Dense Encoder☆21Nov 1, 2024Updated last year
- ☆67Apr 8, 2026Updated 2 months ago
- CNN for predicting the quality of the welding☆14Mar 10, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A brief and partial summary of RLHF algorithms.☆152Mar 4, 2025Updated last year
- fast trainer for educational purposes☆26Jun 4, 2026Updated 2 weeks ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆301Apr 10, 2024Updated 2 years ago
- ☆20Oct 3, 2019Updated 6 years ago
- A curated list of resources for activation engineering☆139Oct 2, 2025Updated 8 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- ☆12Jul 4, 2024Updated last year