epfml / optML-pkuLinks
summer school materials
☆46Updated 2 years ago
Alternatives and similar repositories for optML-pku
Users that are interested in optML-pku are comparing it to the libraries listed below
Sorting:
- Neural Tangent Kernel Papers☆118Updated 9 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆30Updated 11 months ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆200Updated 9 months ago
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆111Updated 3 months ago
- ☆71Updated 10 months ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆290Updated last year
- ☆34Updated 2 years ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆66Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆64Updated 7 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆376Updated this week
- ☆52Updated last year
- Collect optimizer related papers, data, repositories☆97Updated 11 months ago
- An Elegant Library for Bayesian Deep Learning in PyTorch☆26Updated 2 years ago
- Distributed K-FAC preconditioner for PyTorch☆91Updated this week
- Deep Learning & Information Bottleneck☆61Updated 2 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆14Updated 8 months ago
- ☆236Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆51Updated last year
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆42Updated 3 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆27Updated 2 years ago
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆14Updated last year
- Python implementation of Scaling Neural Tangent Kernels via Sketching and Random Features☆12Updated 3 years ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆34Updated 11 months ago
- Benchmark for bi-level optimization solvers☆49Updated 4 months ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆32Updated last year
- Summer course on mathematical theory of deep learning☆53Updated 6 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 3 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- Bi-level Optimization for Advanced Deep Learning☆47Updated 3 years ago
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆43Updated last month