Shentao-YANG / Preference_Grounded_GuidanceView external linksLinks
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
Alternatives and similar repositories for Preference_Grounded_Guidance
Users that are interested in Preference_Grounded_Guidance are comparing it to the libraries listed below
Sorting:
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆51May 12, 2025Updated 9 months ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 4 months ago
- ☆18Jun 3, 2024Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- ☆20May 12, 2022Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- [NeurIPS 2024] Can Language Models Learn to Skip Steps?☆22Jan 25, 2025Updated last year
- ☆19Nov 8, 2023Updated 2 years ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 2 years ago
- ☆18Mar 28, 2022Updated 3 years ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆18Dec 15, 2020Updated 5 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Nov 17, 2024Updated last year
- Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.☆20Jan 3, 2023Updated 3 years ago
- Tensorflow implementation of Invariant Rationalization☆50Feb 16, 2023Updated 3 years ago
- ☆53Apr 9, 2025Updated 10 months ago
- The repository for paper <Evaluating Open-QA Evaluation>☆25Apr 9, 2024Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆63Dec 25, 2023Updated 2 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆102Mar 15, 2023Updated 2 years ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆39Sep 8, 2025Updated 5 months ago
- ☆24May 22, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- AbstainQA, ACL 2024☆28Feb 4, 2026Updated last week
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆29Oct 30, 2023Updated 2 years ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"☆30Aug 20, 2021Updated 4 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Jun 12, 2023Updated 2 years ago
- [ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆30Aug 2, 2024Updated last year
- ☆35Nov 17, 2021Updated 4 years ago
- ☆39May 2, 2024Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Experiments codes for SIGIR '20 paper "A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data"☆35May 18, 2020Updated 5 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆38Aug 11, 2024Updated last year
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- A review of class imbalanced problems using data augumentation and ensemble learning☆10Mar 15, 2023Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago