nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year
Alternatives and similar repositories for CaMeLS:
Users that are interested in CaMeLS are comparing it to the libraries listed below
- ☆17Updated 4 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆44Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 9 months ago
- Self-Supervised Alignment with Mutual Information☆16Updated 8 months ago
- ☆27Updated 11 months ago
- Few-shot Learning with Auxiliary Data☆26Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 3 weeks ago
- ☆39Updated 2 years ago
- Tasks for describing differences between text distributions.☆16Updated 6 months ago
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- Efficient Scaling laws and collaborative pretraining.☆14Updated 2 weeks ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 8 months ago
- ☆22Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated last month
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆21Updated 5 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆51Updated 10 months ago
- Long Context Extension and Generalization in LLMs☆48Updated 4 months ago
- ☆80Updated 11 months ago
- Augmenting Statistical Models with Natural Language Parameters☆22Updated 4 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆19Updated 5 months ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆58Updated last year
- ☆19Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- This repository contains data, code and models for contextual noncompliance.☆20Updated 6 months ago
- ☆44Updated 6 months ago
- ☆25Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆42Updated 6 months ago