dtsip / in-context-learningLinks

☆240

Alternatives and similar repositories for in-context-learning

Users that are interested in in-context-learning are comparing it to the libraries listed below

Sorting:

pomonam / kronfluence
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
☆166Updated 4 months ago
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆108Updated last year
KihoPark / linear_rep_geometry
☆107Updated 8 months ago
GFNOrg / gfn-lm-tuning
☆185Updated last year
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆220Updated 11 months ago
roeehendel / icl_task_vectors
☆98Updated last year
DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…
☆18Updated 11 months ago
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆105Updated 2 years ago
TRAIS-Lab / dattri
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
☆90Updated last week
HoagyC / sparse_coding
Using sparse coding to find distributed representations used by neural networks.
☆280Updated last year
redwoodresearch / Easy-Transformer
☆126Updated last year
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆181Updated 6 months ago
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆129Updated last year
lee-ny / teaching_arithmetic
☆83Updated 2 years ago
jacobdunefsky / transcoder_circuits
☆181Updated 11 months ago
allenbai01 / transformers-as-statisticians
☆34Updated 2 years ago
google-research / jax-influence
☆62Updated 3 years ago
zlin7 / UQ-NLG
☆102Updated last year
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆82Updated 7 months ago
mmatena / model_merging
☆77Updated 3 years ago
mechanistic-interpretability-grokking / progress-measures-paper
☆69Updated 3 years ago
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆76Updated last year
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆25Updated last year
shehper / sparse-dictionary-learning
An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"
☆49Updated last year
locuslab / edge-of-stability
☆71Updated 10 months ago
deeplearning-wisc / args
☆45Updated last year
zyushun / hessian-spectrum
Code for the paper: Why Transformers Need Adam: A Hessian Perspective
☆64Updated 7 months ago
aw31 / empirical-ntks
Efficient empirical NTKs in PyTorch
☆22Updated 3 years ago
davidbau / baukit
☆234Updated last year
mega002 / ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆97Updated 4 years ago