wtong98 / mlp-icl
☆10Updated 6 months ago
Alternatives and similar repositories for mlp-icl:
Users that are interested in mlp-icl are comparing it to the libraries listed below
- Code accompanying the paper "A contrastive rule for meta-learning"☆11Updated 5 months ago
- ☆35Updated last year
- ☆30Updated 5 months ago
- Open source code for EigenGame.☆30Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Recycling diverse models☆44Updated 2 years ago
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆44Updated 3 years ago
- Parallelizing non-linear sequential models over the sequence length☆51Updated 2 months ago
- ☆52Updated 5 months ago
- ☆27Updated 2 years ago
- Deep Learning & Information Bottleneck☆58Updated last year
- ☆22Updated 2 years ago
- ☆55Updated this week
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆25Updated last year
- Code for "The Expressive Power of Low-Rank Adaptation".☆20Updated 11 months ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- Pytorch code for experiments on Linear Transformers☆20Updated last year
- ☆16Updated 11 months ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…☆12Updated last week
- ☆49Updated last year
- Deep Networks Grok All the Time and Here is Why☆33Updated 10 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆28Updated last year
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- Codes for the paper The emergence of clusters in self-attention dynamics.☆15Updated last year
- Code to reproduce the experimental results from the paper "Active Invariant Causal Prediction: Experiment Selection Through Stability", b…☆19Updated last year
- ☆31Updated 11 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated last year