epfml / optML-pkuLinks
summer school materials
☆44Updated 2 years ago
Alternatives and similar repositories for optML-pku
Users that are interested in optML-pku are comparing it to the libraries listed below
Sorting:
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆189Updated 7 months ago
- Neural Tangent Kernel Papers☆115Updated 6 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆29Updated 9 months ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆281Updated last year
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆109Updated last month
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆66Updated 10 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆337Updated 3 weeks ago
- ☆70Updated 8 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆60Updated 4 months ago
- ☆232Updated 2 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆49Updated 3 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆48Updated last year
- Deep Learning & Information Bottleneck☆61Updated 2 years ago
- Easily download anonymous Github repositories from https://anonymous.4open.science/ with a GUI interface☆97Updated last year
- ☆52Updated last year
- A curated list of Model Merging methods.☆92Updated 10 months ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- Collect optimizer related papers, data, repositories☆94Updated 8 months ago
- Python implementation of Scaling Neural Tangent Kernels via Sketching and Random Features☆13Updated 3 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 4 months ago
- This repo contains papers, books, tutorials and resources on Riemannian optimization.☆37Updated 2 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆32Updated 9 months ago
- ☆32Updated 2 years ago
- Visualization of mean field and neural tangent kernel regime☆20Updated last year
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆16Updated 5 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆115Updated 8 months ago
- Bi-level Optimization for Advanced Deep Learning☆46Updated 3 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Updated 2 years ago
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆42Updated 3 years ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆18Updated 2 years ago