Lingkai-Kong / RE-Control
☆11Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for RE-Control
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆11Updated 10 months ago
- ☆75Updated 9 months ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆38Updated 9 months ago
- ☆19Updated last month
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆16Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆52Updated 2 months ago
- Lightweight Adapting for Black-Box Large Language Models☆18Updated 8 months ago
- ☆10Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆26Updated last week
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆85Updated last year
- ☆125Updated 9 months ago
- ☆26Updated last year
- ☆35Updated 9 months ago
- Bayesian low-rank adaptation for large language models☆23Updated 6 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆29Updated 8 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆34Updated 2 weeks ago
- ☆13Updated 6 months ago
- ☆29Updated this week
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆28Updated 10 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆49Updated 2 weeks ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆40Updated 6 months ago
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆27Updated 3 weeks ago
- Rewarded soups official implementation☆50Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆53Updated 3 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated last year
- ☆14Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆13Updated 2 weeks ago
- ☆18Updated 5 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆121Updated 8 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated 8 months ago