uds-lsv / llmftLinks
Fine-tuning large language models with huggingface transformers and deepspeed
☆31Updated last year
Alternatives and similar repositories for llmft
Users that are interested in llmft are comparing it to the libraries listed below
Sorting:
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated 2 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆14Updated 5 months ago
- ☆98Updated last year
- ☆96Updated last year
- ☆84Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆31Updated 7 months ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- Exploration of automated dataset selection approaches at large scales.☆47Updated 5 months ago
- ☆100Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆79Updated 8 months ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- Data Valuation on In-Context Examples (ACL23)☆24Updated 7 months ago
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 8 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 11 months ago
- ☆100Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆59Updated 2 weeks ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆30Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆116Updated last year
- LLM-Merging: Building LLMs Efficiently through Merging☆203Updated 11 months ago
- ☆46Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆56Updated 2 years ago
- ☆27Updated 2 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 4 months ago
- ☆99Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆24Updated last year
- ☆35Updated last year
- ☆20Updated last year
- Directional Preference Alignment☆59Updated 11 months ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆94Updated 3 years ago