JoelNiklaus / loss_landscape
Code for visualizing the loss landscape of neural nets
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for loss_landscape
- Deep Learning & Information Bottleneck☆50Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆24Updated last week
- Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.☆32Updated last year
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆40Updated last month
- A curated list of Robust Machine Learning papers/articles and recent advancements.☆27Updated 2 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 2 years ago
- Collect optimizer related papers, data, repositories☆80Updated 11 months ago
- Official Implementation of the CVPR'23 paper 'Regularization of polynomial networks for image recognition'.☆9Updated last year
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆25Updated last year
- ☆15Updated last year
- ☆23Updated 2 years ago
- ☆17Updated 2 years ago
- ☆32Updated last year
- ☆34Updated 2 years ago
- ☆13Updated last year
- Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]☆14Updated 2 years ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆86Updated 3 months ago
- Neural Tangent Kernel Papers☆92Updated 8 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆21Updated 11 months ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆53Updated last year
- Recycling diverse models☆43Updated last year
- Optimal Transport in the Big Data Era☆93Updated last week
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆36Updated last year
- ☆12Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆49Updated 2 weeks ago
- ☆21Updated last year
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆44Updated 5 months ago
- A simple and efficient baseline for data attribution☆11Updated last year
- Bayesian Low-Rank Adaptation for Large Language Models☆27Updated 4 months ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆13Updated last year