RylanSchaeffer / Stanford-AI-Alignment-Double-Descent-TutorialLinks

Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

☆27

Alternatives and similar repositories for Stanford-AI-Alignment-Double-Descent-Tutorial

Users that are interested in Stanford-AI-Alignment-Double-Descent-Tutorial are comparing it to the libraries listed below

Sorting:

facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago
noranta4 / ASIF
Personal implementation of ASIF by Antonio Norelli
☆25Updated last year
EleutherAI / features-across-time
Understanding how features learned by neural networks evolve throughout training
☆36Updated 9 months ago
deel-ai / Craft
👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)
☆66Updated 2 years ago
KempnerInstitute / overcomplete
👋 Overcomplete is a Vision-based SAE Toolbox
☆71Updated last week
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
MadryLab / modeldiff
ModelDiff: A Framework for Comparing Learning Algorithms
☆59Updated last year
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
bilal-chughtai / rep-theory-mech-interp
☆26Updated 2 years ago
sjunhongshen / ORCA
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
☆71Updated last year
Bradley-Butcher / Conformers
Unofficial implementation of Conformal Language Modeling by Quach et al
☆29Updated 2 years ago
ganguli-lab / degrees-of-freedom
☆37Updated 3 years ago
bremen79 / precise
Portfolio REgret for Confidence SEquences
☆20Updated 7 months ago
facebookresearch / DejaVu
Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning
☆36Updated 2 years ago
AllanYangZhou / universal_neural_functional
☆51Updated last year
MadryLab / modelcomponents
Decomposing and Editing Predictions by Modeling Model Computation
☆138Updated last year
szc12153 / sparse_interpolated_experts
Official implementation for Sparse MetA-Tuning (SMAT)
☆18Updated last week
gregorbachmann / scaling_mlps
☆51Updated last year
SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
lucidrains / AMIE-pytorch
Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind
☆66Updated 10 months ago
uclaml / PDE
Official repo of Progressive Data Expansion: data, code and evaluation
☆29Updated last year
shikaiqiu / compute-better-spent
☆53Updated 10 months ago
pomonam / jax-influence
A simple Jax implementation of influence functions.
☆17Updated last year
KindXiaoming / Omnigrok
Omnigrok: Grokking Beyond Algorithmic Data
☆61Updated 2 years ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆53Updated last year
alexrame / diwa
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Updated 2 years ago
google-deepmind / conformal_training
This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…
☆130Updated 2 years ago
visinf / fast-axiomatic-attribution
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆16Updated 2 years ago
SHI-Labs / CompactNet
☆31Updated last year