RylanSchaeffer / Stanford-AI-Alignment-Double-Descent-Tutorial
Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
☆26Updated last year
Alternatives and similar repositories for Stanford-AI-Alignment-Double-Descent-Tutorial:
Users that are interested in Stanford-AI-Alignment-Double-Descent-Tutorial are comparing it to the libraries listed below
- ☆26Updated last year
- ☆15Updated last year
- Recycling diverse models☆44Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- ☆29Updated last year
- ☆49Updated last year
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- Deep Networks Grok All the Time and Here is Why☆34Updated 11 months ago
- Quantification of Uncertainty with Adversarial Models☆28Updated last year
- Understanding how features learned by neural networks evolve throughout training☆34Updated 6 months ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Google Research☆46Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆62Updated last year
- we got you bro☆35Updated 8 months ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆11Updated last year
- Unofficial implementation of Conformal Language Modeling by Quach et al☆28Updated last year
- ☆31Updated 3 months ago
- ☆15Updated 2 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆60Updated 7 months ago
- Code for experiments on transformers using Markovian data.☆11Updated 5 months ago
- ☆17Updated 2 years ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 2 months ago
- ☆11Updated 11 months ago
- Universal Neurons in GPT2 Language Models☆27Updated 10 months ago
- ☆9Updated 2 years ago
- Portfolio REgret for Confidence SEquences☆14Updated 4 months ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated last year
- ☆18Updated last month
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year