uds-lsv / llmftLinks
Fine-tuning large language models with huggingface transformers and deepspeed
☆31Updated last year
Alternatives and similar repositories for llmft
Users that are interested in llmft are comparing it to the libraries listed below
Sorting:
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated last year
- Data Valuation on In-Context Examples (ACL23)☆23Updated 5 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆36Updated last year
- ☆13Updated 6 months ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆77Updated 6 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆37Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆30Updated 5 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆30Updated last month
- ☆40Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 2 months ago
- Progressive Prompts: Continual Learning for Language Models☆94Updated 2 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆26Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Directional Preference Alignment☆57Updated 9 months ago
- [EMNLP 2022] Continual Training of Language Models for Few-Shot Learning☆45Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆55Updated 2 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆75Updated last year
- ☆86Updated last year
- ☆48Updated last month
- ☆95Updated last year
- ☆28Updated last year
- ☆29Updated 11 months ago
- ☆31Updated 11 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆74Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆62Updated 10 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 9 months ago
- ☆14Updated last year
- ☆31Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year