epfml / optML-pkuLinks
summer school materials
☆46Updated 2 years ago
Alternatives and similar repositories for optML-pku
Users that are interested in optML-pku are comparing it to the libraries listed below
Sorting:
- Neural Tangent Kernel Papers☆120Updated 11 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆30Updated last year
- ☆73Updated last year
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆119Updated 5 months ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆289Updated last year
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆202Updated 11 months ago
- Collect optimizer related papers, data, repositories☆98Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Updated 9 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆382Updated last week
- (NeurIPS 2024) QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation☆34Updated last month
- Distributed K-FAC preconditioner for PyTorch☆93Updated this week
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆67Updated last year
- Deep Learning & Information Bottleneck☆62Updated 2 years ago
- ☆51Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆51Updated last year
- ☆240Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 4 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 3 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Updated 8 months ago
- ☆34Updated 2 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆33Updated last year
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆16Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆38Updated last year
- Python implementation of Scaling Neural Tangent Kernels via Sketching and Random Features☆13Updated 3 years ago
- A curated list of Model Merging methods.☆94Updated 2 weeks ago
- Summer course on mathematical theory of deep learning☆53Updated 6 years ago
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆42Updated 3 years ago
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆74Updated 5 years ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated 2 years ago