JoelNiklaus / loss_landscapeLinks
Code for visualizing the loss landscape of neural nets
☆10Updated 4 years ago
Alternatives and similar repositories for loss_landscape
Users that are interested in loss_landscape are comparing it to the libraries listed below
Sorting:
- Deep Learning & Information Bottleneck☆60Updated last year
- A simple Jax implementation of influence functions.☆16Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆31Updated 7 months ago
- ☆13Updated 2 years ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆26Updated last year
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 2 months ago
- ☆12Updated 2 years ago
- Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.☆33Updated last year
- ☆16Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆57Updated last year
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆59Updated 8 months ago
- Recycling diverse models☆44Updated 2 years ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆18Updated 2 years ago
- ☆18Updated 2 years ago
- ☆15Updated 2 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- Robust Principles: Architectural Design Principles for Adversarially Robust CNNs☆23Updated last year
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Collect optimizer related papers, data, repositories☆91Updated 7 months ago
- [ICLR 2024] Dynamic Sparse Training with Structured Sparsity☆18Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆47Updated last year
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- A curated list of Robust Machine Learning papers/articles and recent advancements.☆31Updated 2 years ago
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆71Updated last year
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆20Updated 4 months ago
- gradient norm penalty☆40Updated last year
- Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)☆44Updated 3 weeks ago
- Code for the paper "Getting a CLUE: A Method for Explaining Uncertainty Estimates"☆34Updated last year