Jiacheng-Zhu-AIML / AsymmetryLoRALinks
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆35Updated last year
Alternatives and similar repositories for AsymmetryLoRA
Users that are interested in AsymmetryLoRA are comparing it to the libraries listed below
Sorting:
- Rewarded soups official implementation☆58Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆45Updated 9 months ago
- ☆49Updated last year
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆65Updated 3 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆112Updated 3 months ago
- A Sober Look at Language Model Reasoning☆75Updated last month
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 2 months ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆37Updated 11 months ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆34Updated 2 weeks ago
- Test-time-training on nearest neighbors for large language models☆44Updated last year
- ☆35Updated 6 months ago
- ☆15Updated 10 months ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆20Updated last month
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆96Updated last week
- ☆43Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆45Updated 8 months ago
- ☆27Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆73Updated last month
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆74Updated 4 months ago
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆30Updated 3 months ago
- ☆30Updated last year
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆37Updated 4 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆28Updated last year
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆18Updated 10 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆18Updated 3 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆86Updated 8 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- ☆31Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Updated last year