UCSB-NLP-Chang / Prereq_tuneLinks
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆10Updated 4 months ago
Alternatives and similar repositories for Prereq_tune
Users that are interested in Prereq_tune are comparing it to the libraries listed below
Sorting:
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆31Updated last year
- ☆16Updated 10 months ago
- ☆14Updated last year
- ACL24☆9Updated 11 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated last week
- Self-Supervised Alignment with Mutual Information☆18Updated last year
- ☆15Updated last month
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 8 months ago
- ☆19Updated 3 weeks ago
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆18Updated 3 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 10 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 5 months ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Updated 8 months ago
- AbstainQA, ACL 2024☆25Updated 7 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- Tasks for describing differences between text distributions.☆16Updated 9 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆23Updated 3 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 6 months ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free☆38Updated 3 weeks ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆16Updated 2 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 7 months ago
- ☆19Updated 3 months ago
- ☆19Updated 10 months ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆14Updated 6 months ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆30Updated 2 years ago