UW-Madison-Lee-Lab / Expressive_Power_of_LoRA
Code for "The Expressive Power of Low-Rank Adaptation".
☆20Updated 10 months ago
Alternatives and similar repositories for Expressive_Power_of_LoRA:
Users that are interested in Expressive_Power_of_LoRA are comparing it to the libraries listed below
- ☆17Updated 8 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated 10 months ago
- ☆28Updated last year
- ☆44Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- ☆30Updated 2 months ago
- ☆17Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated last month
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 9 months ago
- ☆30Updated last year
- ☆14Updated 4 months ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- ☆33Updated last year
- Efficient Scaling laws and collaborative pretraining.☆15Updated last month
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆33Updated 4 months ago
- ☆18Updated 4 months ago
- ☆13Updated last month
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆71Updated 4 months ago
- ☆33Updated last year
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- Blog post☆17Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆15Updated 7 months ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆23Updated last year
- ☆15Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆45Updated last month