mansheej / icl-task-diversityLinks

Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"

☆23

Alternatives and similar repositories for icl-task-diversity

Users that are interested in icl-task-diversity are comparing it to the libraries listed below

Sorting:

locuslab / edge-of-stability
☆73Updated last year
allenbai01 / transformers-as-statisticians
☆34Updated 2 years ago
google-research / jax-influence
☆63Updated 3 years ago
pomonam / kronfluence
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
☆171Updated 5 months ago
DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…
☆20Updated last year
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆106Updated 2 years ago
KihoPark / linear_rep_geometry
☆110Updated 9 months ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated 2 years ago
aw31 / empirical-ntks
Efficient empirical NTKs in PyTorch
☆22Updated 3 years ago
ssagawa / overparam_spur_corr
An Investigation of Why Overparameterization Exacerbates Spurious Correlations
☆30Updated 5 years ago
joshuacnf / paradox-learning2reason
☆37Updated 11 months ago
RobertCsordas / modules
The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…
☆46Updated 2 years ago
mechanistic-interpretability-grokking / progress-measures-paper
☆70Updated 3 years ago
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆222Updated last year
dtsip / in-context-learning
☆242Updated last year
r-three / mats
☆32Updated last year
mega002 / ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆99Updated 4 years ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated 2 years ago
UFO-101 / auto-circuit
A library for efficient patching and automatic circuit discovery.
☆80Updated 4 months ago
pratyushmaini / localizing-memorization
Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"
☆20Updated 2 years ago
shauli-ravfogel / rlace-icml
☆36Updated 3 years ago
ApolloResearch / deception-detection
☆27Updated 9 months ago
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Updated 2 years ago
nitarshan / robust-generalization-measures
Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)
☆28Updated 4 years ago
TRAIS-Lab / dattri
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
☆95Updated last week
aks2203 / deep-thinking
A centralized place for deep thinking code and experiments
☆87Updated 2 years ago
p-lambda / in-n-out
Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"
☆13Updated 4 years ago
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆27Updated last year
tding1 / Neural-Collapse
[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features
☆59Updated 3 years ago
edwardjhu / TP4
Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)
☆63Updated 4 years ago