wtong98 / mlp-iclLinks
☆11Updated 10 months ago
Alternatives and similar repositories for mlp-icl
Users that are interested in mlp-icl are comparing it to the libraries listed below
Sorting:
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆47Updated 4 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- ☆23Updated 2 years ago
- Code accompanying the paper "A contrastive rule for meta-learning"☆12Updated 8 months ago
- ☆37Updated last year
- ☆28Updated 2 years ago
- ☆19Updated last year
- ☆70Updated 7 months ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated 2 years ago
- ☆32Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated last year
- A centralized place for deep thinking code and experiments☆85Updated last year
- Omnigrok: Grokking Beyond Algorithmic Data☆58Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated last year
- Code for testing DCT plus Sparse (DCTpS) networks☆14Updated 4 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆19Updated last year
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 3 years ago
- ☆28Updated last week
- Parallelizing non-linear sequential models over the sequence length☆52Updated 3 weeks ago
- ☆53Updated 9 months ago
- Recycling diverse models☆45Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆46Updated 5 months ago
- Distilling Model Failures as Directions in Latent Space☆47Updated 2 years ago
- Layerwise Batch Entropy Regularization☆23Updated 2 years ago
- Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…☆27Updated 2 weeks ago
- ☆51Updated last year
- Pytorch code for experiments on Linear Transformers☆21Updated last year