Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
Alternatives and similar repositories for Preference_Grounded_Guidance
Users that are interested in Preference_Grounded_Guidance are comparing it to the libraries listed below
Sorting:
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 9 months ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 5 months ago
- ☆18Jun 3, 2024Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- ☆20May 12, 2022Updated 3 years ago
- [NeurIPS 2024] Can Language Models Learn to Skip Steps?☆22Jan 25, 2025Updated last year
- ☆19Nov 8, 2023Updated 2 years ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- ☆18Mar 28, 2022Updated 3 years ago
- Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.☆20Jan 3, 2023Updated 3 years ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆18Dec 15, 2020Updated 5 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Nov 17, 2024Updated last year
- Tensorflow implementation of Invariant Rationalization☆50Feb 16, 2023Updated 3 years ago
- ☆53Apr 9, 2025Updated 11 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆63Dec 25, 2023Updated 2 years ago
- CausaLM: Causal Model Explanation Through Counterfactual Language Models☆56Jun 14, 2020Updated 5 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆103Mar 15, 2023Updated 2 years ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆39Sep 8, 2025Updated 6 months ago
- AbstainQA, ACL 2024☆29Feb 4, 2026Updated last month
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆29Oct 30, 2023Updated 2 years ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"☆30Aug 20, 2021Updated 4 years ago
- ☆35Nov 17, 2021Updated 4 years ago
- [ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆30Aug 2, 2024Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- "Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21☆35May 7, 2023Updated 2 years ago
- Experiments codes for SIGIR '20 paper "A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data"☆35May 18, 2020Updated 5 years ago
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆38Aug 11, 2024Updated last year
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆16Mar 18, 2025Updated 11 months ago
- ☆11Jun 15, 2019Updated 6 years ago
- A review of class imbalanced problems using data augumentation and ensemble learning☆10Mar 15, 2023Updated 2 years ago
- ☆10Feb 17, 2019Updated 7 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago