Welcome to the 'In Context Learning Theory' Reading Group
☆31Nov 8, 2024Updated last year
Alternatives and similar repositories for Awesome_Large_Foundation_Model_Theory
Users that are interested in Awesome_Large_Foundation_Model_Theory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆62Mar 18, 2026Updated 2 months ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆100Dec 2, 2024Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆26Feb 20, 2026Updated 3 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆401May 19, 2026Updated last week
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- ☆116Feb 25, 2025Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year
- ☆12Sep 16, 2024Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- C library to control standard industrial robots☆11Jul 19, 2018Updated 7 years ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- MNIST experiment from Tensorizing neural networks (Novikov et al. 2015)☆14Oct 22, 2019Updated 6 years ago
- An Elegant Library for Bayesian Deep Learning in PyTorch☆27Dec 19, 2022Updated 3 years ago
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆28Dec 21, 2025Updated 5 months ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Grassmannian Optimization for Tensor Completion and Tracking in the t-SVD Algebra☆11Oct 7, 2025Updated 7 months ago
- ☆18Dec 9, 2020Updated 5 years ago
- Code for lin-RFM used for sparse recovery tasks☆17Mar 13, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the paper "Randomly pivoted Cholesky: Practical approximation of a kernel matrix with few entry evaluations"☆35Dec 4, 2025Updated 5 months ago
- This project plans the welding layers, sequence, as well as all welding points (with pose in 2d) for V-shape groove.☆15Feb 4, 2021Updated 5 years ago
- ☆13Feb 2, 2022Updated 4 years ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- ☆63Apr 8, 2026Updated last month
- A brief and partial summary of RLHF algorithms.☆151Mar 4, 2025Updated last year
- fast trainer for educational purposes☆26May 4, 2026Updated 3 weeks ago
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆299Apr 10, 2024Updated 2 years ago
- ☆20Oct 3, 2019Updated 6 years ago
- A curated list of resources for activation engineering☆137Oct 2, 2025Updated 7 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- ☆12Jul 4, 2024Updated last year
- All-in-One Safety Evaluation Framwork☆50Apr 21, 2026Updated last month
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year