JoelNiklaus / loss_landscapeLinks

Code for visualizing the loss landscape of neural nets

☆10

Alternatives and similar repositories for loss_landscape

Users that are interested in loss_landscape are comparing it to the libraries listed below

Sorting:

calgaryml / condensed-sparsity
[ICLR 2024] Dynamic Sparse Training with Structured Sparsity
☆18Updated last year
pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆27Updated last year
furiosa-ai / ssm-peft
Parameter-Efficient Fine-Tuning of State Space Models (ICML 2025)
☆17Updated last month
poloclub / robust-principles
Robust Principles: Architectural Design Principles for Adversarially Robust CNNs
☆24Updated last year
uclaml / PDE
Official repo of Progressive Data Expansion: data, code and evaluation
☆29Updated last year
facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
sjunhongshen / ORCA
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
☆71Updated last year
fKunstner / noise-sgd-adam-sign
☆16Updated 2 years ago
hhyqhh / KAN-EA
☆19Updated last year
nick11roberts / XD
☆12Updated 3 years ago
sjunhongshen / DASH
☆23Updated 2 years ago
mkhodak / relax
☆15Updated 3 years ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆61Updated 2 years ago
google-research / fooling-feature-visualizations
Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)
☆32Updated last year
MadryLab / bias-transfer
☆15Updated 2 years ago
Bond1995 / Markov
Code for experiments on transformers using Markovian data.
☆17Updated 7 months ago
acmi-lab / RLSbench
Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift
☆34Updated last year
minhtannguyen / transformer-mgk
This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"
☆28Updated 2 years ago
jiequancui / DKL
Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)
☆44Updated last month
CownowAn / DaSS
Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)
☆24Updated last year
UCDvision / PRANC
☆23Updated 2 years ago
TNAS-DCS / TNAS-DCS
☆13Updated 2 years ago
andyjm3 / SLTrain
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆32Updated 8 months ago
google-deepmind / ssl_hsic
☆37Updated 11 months ago
AvivNavon / DWSNets
Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]
☆89Updated last year
arnab-api / romba
Applies ROME and MEMIT on Mamba-S4 models
☆14Updated last year
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
tml-epfl / why-weight-decay
Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]
☆66Updated 9 months ago
MadryLab / modeldiff
ModelDiff: A Framework for Comparing Learning Algorithms
☆59Updated last year
uiuctml / HypStructure
[NeurIPS '24] Code repo for the paper entitled "Learning Structured Representations with Hyperbolic Embeddings" at NeurIPS 2024
☆15Updated 5 months ago