DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.
☆11Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for transformers-icl-second-order
- ☆35Updated 9 months ago
- ☆75Updated 9 months ago
- ☆13Updated 2 months ago
- Lightweight Adapting for Black-Box Large Language Models☆18Updated 8 months ago
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆26Updated last week
- ☆34Updated 3 months ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆19Updated 5 months ago
- ☆11Updated 4 months ago
- ☆25Updated 4 months ago
- ☆125Updated 9 months ago
- ☆24Updated 4 months ago
- ☆15Updated 4 months ago
- ☆26Updated 2 weeks ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆85Updated last year
- ☆13Updated 6 months ago
- Universal Neurons in GPT2 Language Models☆26Updated 5 months ago
- ☆59Updated 2 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆52Updated last week
- ☆44Updated last year
- ☆79Updated last year
- ☆11Updated 5 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆23Updated 5 months ago
- ☆73Updated 4 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆19Updated 11 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆29Updated 8 months ago
- ☆26Updated last year
- ☆70Updated last year
- ☆18Updated last month
- ☆14Updated 8 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆96Updated 5 months ago