RUCAIBox / FIGALinks

[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"

☆11

Alternatives and similar repositories for FIGA

Users that are interested in FIGA are comparing it to the libraries listed below

Sorting:

BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆26Updated 8 months ago
ADaM-BJTU / W2SG
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆16Updated last year
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆35Updated last year
RUCAIBox / HaluEval-2.0
☆44Updated last year
hanxuhu / SeqIns
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆29Updated 7 months ago
Shentao-YANG / Preference_Grounded_Guidance
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆16Updated 5 months ago
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆62Updated 11 months ago
ZeroYuHuang / Transformer-Patcher
☆31Updated last year
Reason-Wang / NAT
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆26Updated last year
GAIR-NLP / BeHonest
BeHonest: Benchmarking Honesty in Large Language Models
☆34Updated 10 months ago
Zce1112zslx / IKE
☆41Updated last year
halfrot / ALaRM
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
☆25Updated last year
JasonForJoy / Model-Editing-Hurt
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆35Updated last month
RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
ChengpengLi1003 / DotaMath
☆30Updated 6 months ago
Linear95 / DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Updated last year
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆59Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆60Updated 7 months ago
katiekang1998 / llm_hallucinations
☆17Updated last year
zhaochen0110 / Cotempqa
Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)
☆32Updated 11 months ago
INK-USC / FiD-ICL
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
☆14Updated last year
CriticBench / CriticBench
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆26Updated last year
deeplearning-wisc / args
☆40Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆27Updated 9 months ago
WeiminXiong / IPR
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
☆59Updated 8 months ago
PrasannS / rlhf-length-biases
☆28Updated last year
GAIR-NLP / Preference-Dissection
☆25Updated last year
xiye17 / TextualExplInContext
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)
☆15Updated 2 years ago
GAIR-NLP / alignment-for-honesty
☆74Updated last year