mansheej / icl-task-diversity
Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"
☆20Updated last year
Alternatives and similar repositories for icl-task-diversity:
Users that are interested in icl-task-diversity are comparing it to the libraries listed below
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆100Updated last year
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆120Updated 5 months ago
- ☆58Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆82Updated 11 months ago
- Efficient empirical NTKs in PyTorch☆18Updated 2 years ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆89Updated 3 years ago
- ☆34Updated 11 months ago
- ☆62Updated last month
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆48Updated last year
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- ☆27Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- ☆28Updated last year
- ☆51Updated 7 months ago
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Updated 4 years ago
- ☆105Updated 2 years ago
- ☆20Updated 3 months ago
- ☆63Updated 2 years ago
- ☆27Updated 6 months ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆14Updated last month
- ☆44Updated last year
- Sparse Autoencoder Training Library☆38Updated 2 months ago
- ☆35Updated 2 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆55Updated 2 weeks ago
- ☆30Updated 3 weeks ago
- Distilling Model Failures as Directions in Latent Space☆46Updated last year
- nanoGPT-like codebase for LLM training☆83Updated this week
- Pytorch code for experiments on Linear Transformers☆16Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆34Updated last year