uds-lsv / llmft
Fine-tuning large language models with huggingface transformers and deepspeed
☆29Updated last year
Alternatives and similar repositories for llmft:
Users that are interested in llmft are comparing it to the libraries listed below
- ☆26Updated 6 months ago
- ☆86Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆71Updated 3 weeks ago
- ☆27Updated 6 months ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆22Updated last year
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Updated last year
- ☆44Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- ☆34Updated 11 months ago
- ☆93Updated 6 months ago
- ☆27Updated 10 months ago
- ☆42Updated last year
- ☆25Updated 8 months ago
- Directional Preference Alignment☆54Updated 3 months ago
- Progressive Prompts: Continual Learning for Language Models☆91Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆64Updated last year
- ☆30Updated 2 months ago
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆24Updated 9 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆26Updated 7 months ago
- ☆19Updated last year
- Data Valuation on In-Context Examples (ACL23)☆23Updated last week
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆59Updated 5 months ago
- ☆78Updated 10 months ago
- ☆43Updated 5 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated 10 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆34Updated 5 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆45Updated 10 months ago