baekrok / DASH-Direction-Aware-SHrinkingLinks
☆13Updated last year
Alternatives and similar repositories for DASH-Direction-Aware-SHrinking
Users that are interested in DASH-Direction-Aware-SHrinking are comparing it to the libraries listed below
Sorting:
- ☆11Updated last month
- ☆73Updated last year
- ☆18Updated last year
- Coresets via Bilevel Optimization☆68Updated 5 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 4 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆38Updated last year
- Simple CIFAR10 ResNet example with JAX.☆23Updated 4 years ago
- ☆17Updated 3 months ago
- [NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features☆60Updated 3 years ago
- Pytorch implementation of neural processes and variants☆29Updated last year
- [UAI 2025] Official code for reproducing paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"☆19Updated 8 months ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆68Updated 2 years ago
- Pytorch code for experiments on Linear Transformers☆24Updated 2 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Updated 3 years ago
- ☆34Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Updated last year
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆18Updated 3 years ago
- Continual Learning with Hypernetworks. A continual learning approach that has the flexibility to learn a dedicated set of parameters, fin…☆170Updated 3 years ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆19Updated last year
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Updated last year
- ☆67Updated 4 years ago
- ☆34Updated 2 years ago
- Code for testing DCT plus Sparse (DCTpS) networks☆14Updated 4 years ago
- ☆35Updated 3 years ago
- ☆18Updated 4 years ago
- Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.☆16Updated last year
- Deep Learning & Information Bottleneck☆63Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 4 years ago
- A PyTorch implementation of "Meta-Amortized Variational Inference and Learning" (https://arxiv.org/abs/1902.01950)☆14Updated 5 years ago
- ☆21Updated 2 years ago