epfml / optML-pku
summer school materials
☆44Updated last year
Alternatives and similar repositories for optML-pku:
Users that are interested in optML-pku are comparing it to the libraries listed below
- Neural Tangent Kernel Papers☆108Updated 3 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆27Updated 5 months ago
- ☆67Updated 4 months ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆177Updated 3 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆57Updated last month
- Python implementation of Scaling Neural Tangent Kernels via Sketching and Random Features☆13Updated 3 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆47Updated 3 years ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆57Updated 6 months ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆24Updated 10 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆30Updated 5 months ago
- Deep Learning & Information Bottleneck☆60Updated last year
- Pytorch code for experiments on Linear Transformers☆20Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- ☆51Updated last year
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆98Updated 9 months ago
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆31Updated 4 months ago
- ☆31Updated last year
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆48Updated 10 months ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 4 years ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆272Updated last year
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆68Updated 4 years ago
- Benchmark for bi-level optimization solvers☆44Updated 5 months ago
- Distributed K-FAC Preconditioner for PyTorch☆85Updated last week
- Collect optimizer related papers, data, repositories☆89Updated 5 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Deep Learning Theory course☆25Updated 3 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆55Updated 2 years ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆18Updated last year
- Visualization of mean field and neural tangent kernel regime☆20Updated 9 months ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆17Updated 2 years ago