Shentao-YANG / Preference_Grounded_GuidanceLinks

Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).

☆16

Alternatives and similar repositories for Preference_Grounded_Guidance

Users that are interested in Preference_Grounded_Guidance are comparing it to the libraries listed below

Sorting:

deeplearning-wisc / args
☆45Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year
joeljang / RLPHF
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
☆110Updated 2 years ago
Re-Align / AlignTDS
Analyzing LLM Alignment via Token distribution shift
☆17Updated last year
ValueCompass / Alignment-Goal-Survey
☆29Updated last year
Linear95 / DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Updated last year
RLHFlow / Directional-Preference-Alignment
Directional Preference Alignment
☆57Updated last year
ADaM-BJTU / W2SG
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Updated last year
princeton-nlp / WhatICLLearns
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆21Updated 2 years ago
Linear95 / APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
☆57Updated last year
WANGXinyiLinda / concept-based-demonstration-selection
Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…
☆75Updated last year
WeiminXiong / IPR
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
☆62Updated last year
liziniu / GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆40Updated 5 months ago
YuxiXie / SelfEval-Guided-Decoding
☆103Updated last year
yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆30Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
chang-github-00 / LLM-Predictive-Decoding
☆14Updated 3 months ago
FreedomIntelligence / OVM
☆69Updated last year
gl-ybnbxb / BoNBoN
☆18Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆61Updated last year
kttian / llm_factuality_tuning
☆38Updated last year
Edward-Sun / easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆123Updated last year
xiye17 / TextualExplInContext
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)
☆16Updated 2 years ago
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆27Updated last year
genrm-star / genrm-critiques
GenRM-CoT: Data release for verification rationales
☆67Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆66Updated 11 months ago
SumilerGAO / SunGen
☆27Updated 2 years ago
Reason-Wang / NAT
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆29Updated last year