Welcome to the 'In Context Learning Theory' Reading Group
☆31Nov 8, 2024Updated last year
Alternatives and similar repositories for Awesome_Large_Foundation_Model_Theory
Users that are interested in Awesome_Large_Foundation_Model_Theory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆205Apr 13, 2026Updated 3 weeks ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆99Dec 2, 2024Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- ☆26Feb 20, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆400Apr 21, 2026Updated 2 weeks ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- ☆114Feb 25, 2025Updated last year
- ☆12Sep 16, 2024Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- some codes for ploting figures☆12May 9, 2025Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- MNIST experiment from Tensorizing neural networks (Novikov et al. 2015)☆14Oct 22, 2019Updated 6 years ago
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆28Dec 21, 2025Updated 4 months ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Examine the all the leakages happened from 2010-2017 and apply machine learning to detect equipment failure☆16Mar 3, 2020Updated 6 years ago
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆12Jan 13, 2025Updated last year
- Grassmannian Optimization for Tensor Completion and Tracking in the t-SVD Algebra☆11Oct 7, 2025Updated 7 months ago
- ☆18Dec 9, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the paper "Randomly pivoted Cholesky: Practical approximation of a kernel matrix with few entry evaluations"☆34Dec 4, 2025Updated 5 months ago
- ☆13Feb 2, 2022Updated 4 years ago
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)☆37Oct 30, 2024Updated last year
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 8 months ago
- ☆15Aug 27, 2022Updated 3 years ago
- ☆62Apr 8, 2026Updated last month
- Neural Tangent Kernel Papers☆122Jan 12, 2025Updated last year
- CNN for predicting the quality of the welding☆14Mar 10, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A brief and partial summary of RLHF algorithms.☆151Mar 4, 2025Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆298Apr 10, 2024Updated 2 years ago
- ☆20Oct 3, 2019Updated 6 years ago
- A curated list of resources for activation engineering☆137Oct 2, 2025Updated 7 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- ☆12Jul 4, 2024Updated last year