DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.
☆13Updated this week
Related projects ⓘ
Alternatives and complementary repositories for transformers-icl-second-order
- Lightweight Adapting for Black-Box Large Language Models☆18Updated 9 months ago
- ☆33Updated 9 months ago
- ☆76Updated 9 months ago
- ☆36Updated 3 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆54Updated 2 weeks ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆53Updated last month
- A library for efficient patching and automatic circuit discovery.☆31Updated last month
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆19Updated 5 months ago
- Test-time-training on nearest neighbors for large language models☆27Updated 7 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆14Updated 3 weeks ago
- ☆14Updated 8 months ago
- ☆127Updated 10 months ago
- ☆81Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆84Updated 7 months ago
- ☆49Updated last year
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆32Updated 2 weeks ago
- ☆13Updated 6 months ago
- ☆13Updated 2 months ago
- Rewarded soups official implementation☆51Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆26Updated 2 weeks ago
- ☆26Updated 3 weeks ago
- ☆15Updated 9 months ago
- ☆77Updated 4 months ago
- ☆70Updated last year
- Universal Neurons in GPT2 Language Models☆27Updated 5 months ago
- Bayesian low-rank adaptation for large language models☆23Updated 6 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆15Updated 2 weeks ago
- ☆20Updated 11 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆119Updated last month
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆14Updated last year