DeqingFu / transformers-icl-second-orderLinks
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.
☆18Updated 9 months ago
Alternatives and similar repositories for transformers-icl-second-order
Users that are interested in transformers-icl-second-order are comparing it to the libraries listed below
Sorting:
- ☆23Updated 7 months ago
- ☆20Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆116Updated last year
- ☆238Updated last year
- ☆106Updated 7 months ago
- ☆99Updated last year
- ☆44Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆23Updated last year
- ☆83Updated 2 years ago
- ☆33Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- ☆34Updated 8 months ago
- ☆18Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆128Updated 2 months ago
- Universal Neurons in GPT2 Language Models☆30Updated last year
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 5 months ago
- ☆53Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆20Updated 5 months ago
- Rewarded soups official implementation☆60Updated last year
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆31Updated 7 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆123Updated last year
- ☆186Updated last year
- ☆32Updated last year
- A library for efficient patching and automatic circuit discovery.☆76Updated last month
- Test-time-training on nearest neighbors for large language models☆46Updated last year
- ☆28Updated 7 months ago
- ☆52Updated 5 months ago
- ☆50Updated last year
- Sparse Autoencoder Training Library☆54Updated 4 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆77Updated 6 months ago