AlanAnsell / peft
☆15Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for peft
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 10 months ago
- Scaling Sparse Fine-Tuning to Large Language Models☆17Updated 9 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆18Updated 3 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- SILO Language Models code repository☆80Updated 8 months ago
- The repository contains code for Adaptive Data Optimization☆18Updated last month
- Tasks for describing differences between text distributions.☆16Updated 3 months ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆23Updated 9 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆61Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆32Updated last month
- ☆25Updated 11 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 4 months ago
- ☆24Updated 8 months ago
- Critique-out-Loud Reward Models☆38Updated last month
- ☆15Updated 4 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆19Updated 2 months ago
- ☆36Updated 3 months ago
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- ☆62Updated 3 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 4 months ago
- ☆64Updated last month
- ☆38Updated 7 months ago
- ☆22Updated 2 weeks ago
- ☆47Updated 9 months ago
- ☆53Updated 3 weeks ago
- Directional Preference Alignment☆51Updated 2 months ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆53Updated last month
- Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"☆20Updated last year
- ☆22Updated this week