baekrok / DASH-Direction-Aware-SHrinkingLinks
☆10Updated 6 months ago
Alternatives and similar repositories for DASH-Direction-Aware-SHrinking
Users that are interested in DASH-Direction-Aware-SHrinking are comparing it to the libraries listed below
Sorting:
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆12Updated 2 years ago
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆11Updated 2 months ago
- Simple CIFAR10 ResNet example with JAX.☆23Updated 4 years ago
- ☆68Updated 6 months ago
- ☆34Updated last year
- ☆40Updated 2 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆66Updated 2 years ago
- Official PyTorch implementation of "Loss-Curvature Matching for Dataset Selection and Condensation" (AISTATS 2023)☆21Updated 2 years ago
- [NeurIPS'22] Official Repository for Characterizing Datapoints via Second-Split Forgetting☆15Updated last year
- Coresets via Bilevel Optimization☆66Updated 4 years ago
- Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.☆16Updated 7 months ago
- ☆19Updated last year
- "Scalable and Order-robust Continual Learning with Additive Parameter Decomposition", ICLR 2020☆23Updated 3 years ago
- ☆26Updated 3 years ago
- ☆25Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated 11 months ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆48Updated last year
- [ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"☆16Updated last year
- Predicting Out-of-Distribution Error with the Projection Norm☆19Updated 2 years ago
- ☆13Updated 3 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated last year
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆32Updated this week
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆28Updated 2 years ago
- Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]☆30Updated last year
- Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)☆23Updated 2 weeks ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆36Updated 2 years ago
- ☆14Updated 4 years ago
- Deep Learning & Information Bottleneck☆60Updated last year