RylanSchaeffer / Stanford-AI-Alignment-Double-Descent-Tutorial
Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
☆19Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for Stanford-AI-Alignment-Double-Descent-Tutorial
- ☆24Updated last year
- ☆18Updated 6 months ago
- You should use PySR to find scaling laws. Here's an example.☆31Updated last year
- First-Order Probabilistic Programming Language☆26Updated 5 years ago
- Minimum Description Length probing for neural network representations☆16Updated 2 weeks ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Personal solutions to the Triton Puzzles☆16Updated 3 months ago
- gzip Predicts Data-dependent Scaling Laws☆32Updated 5 months ago
- ☆58Updated 2 years ago
- Pytorch implementation of SuperPolyak subgradient method.☆43Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆14Updated 3 weeks ago
- Understanding how features learned by neural networks evolve throughout training☆31Updated 3 weeks ago
- PyTorch Implementation of the paper "Towards Learning Abductive Reasoning using VSA Distributed Representations".☆12Updated 2 months ago
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆18Updated 2 years ago
- ☆40Updated 4 months ago
- Clean RL implementation using MLX☆26Updated 8 months ago
- Code for minimum-entropy coupling.☆29Updated 4 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 3 weeks ago
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆13Updated last month
- ☆28Updated last year
- ☆8Updated last year
- LaTeX source code for the slides☆21Updated 3 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆34Updated last year
- Tangle Software Library☆23Updated 6 months ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆30Updated last year
- Implementation of Spectral State Space Models☆17Updated 8 months ago
- ☆22Updated this week
- we got you bro☆32Updated 3 months ago
- GL Mathematics for Numpy☆21Updated 7 months ago
- Implementations of growing and pruning in neural networks☆21Updated last year