UW-Madison-Lee-Lab / Expressive_Power_of_LoRA
Code for "The Expressive Power of Low-Rank Adaptation".
☆19Updated 9 months ago
Alternatives and similar repositories for Expressive_Power_of_LoRA:
Users that are interested in Expressive_Power_of_LoRA are comparing it to the libraries listed below
- ☆16Updated 6 months ago
- ☆17Updated 2 years ago
- ☆14Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated 10 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 7 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated last month
- Minimum Description Length probing for neural network representations☆18Updated this week
- ☆28Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 9 months ago
- ☆44Updated last year
- ☆12Updated 2 months ago
- ☆26Updated last year
- Efficient Scaling laws and collaborative pretraining.☆13Updated this week
- Latest Weight Averaging (NeurIPS HITY 2022)☆28Updated last year
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- ☆27Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 3 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆59Updated 4 months ago
- ☆12Updated 10 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆15Updated 5 months ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated 2 weeks ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆24Updated 2 months ago
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆24Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated last week