DeqingFu / transformers-icl-second-orderLinks

Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.

☆17

Alternatives and similar repositories for transformers-icl-second-order

Users that are interested in transformers-icl-second-order are comparing it to the libraries listed below

Sorting:

dtsip / in-context-learning
☆234Updated last year
KihoPark / linear_rep_geometry
☆103Updated 5 months ago
deeplearning-wisc / args
☆43Updated last year
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆74Updated 4 months ago
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆108Updated last year
siyan-zhao / ICL_decision_boundary
official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…
☆19Updated last week
adamkarvonen / SAE_BoardGameEval
☆23Updated 6 months ago
alexrame / rewardedsoups
Rewarded soups official implementation
☆58Updated last year
noanabeshima / matryoshka-saes
☆21Updated 8 months ago
UFO-101 / auto-circuit
A library for efficient patching and automatic circuit discovery.
☆73Updated 2 weeks ago
GFNOrg / gfn-lm-tuning
☆184Updated last year
allenbai01 / transformers-as-statisticians
☆32Updated 2 years ago
lee-ny / teaching_arithmetic
☆83Updated last year
princeton-pli / what-makes-good-rm
What Makes a Reward Model a Good Teacher? An Optimization Perspective
☆34Updated last month
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆113Updated last year
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆27Updated last year
ApolloResearch / e2e_sae
Sparse Autoencoder Training Library
☆54Updated 3 months ago
roeehendel / icl_task_vectors
☆96Updated last year
automl / is_mamba_capable_of_icl
☆18Updated last year
google-research / jax-influence
☆60Updated 3 years ago
wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆30Updated last year
tlc4418 / llm_optimization
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
☆44Updated 6 months ago
redwoodresearch / Easy-Transformer
☆121Updated last year
clarifying-EM / model-organisms-for-EM
Code repo for the model organisms and convergent directions of EM papers.
☆20Updated 2 weeks ago
explanare / ravel
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆52Updated 10 months ago
liziniu / GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆35Updated 2 months ago
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆112Updated last month
gregorbachmann / Next-Token-Failures
☆88Updated last year
Shentao-YANG / Preference_Grounded_Guidance
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆16Updated 6 months ago
activatedgeek / calibration-tuning
☆51Updated 3 months ago