UW-Madison-Lee-Lab / Expressive_Power_of_LoRALinks
Code for "The Expressive Power of Low-Rank Adaptation".
☆20Updated last year
Alternatives and similar repositories for Expressive_Power_of_LoRA
Users that are interested in Expressive_Power_of_LoRA are comparing it to the libraries listed below
Sorting:
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- ☆34Updated 2 years ago
- ☆30Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- ☆45Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆24Updated last year
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆108Updated last year
- ☆20Updated last year
- ☆18Updated 10 months ago
- ☆101Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆15Updated last year
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- ☆27Updated 2 years ago
- Blog post☆17Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Updated 2 years ago
- The repository contains code for Adaptive Data Optimization☆25Updated 10 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated last year
- ☆16Updated 2 years ago
- ☆33Updated last year
- Efficient Scaling laws and collaborative pretraining.☆18Updated 3 weeks ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆15Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.☆47Updated 7 months ago
- ☆20Updated last year
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆23Updated 6 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated 11 months ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆55Updated 2 years ago
- ☆31Updated last year