dunzeng / MORELinks

Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment

☆16

Alternatives and similar repositories for MORE

Users that are interested in MORE are comparing it to the libraries listed below

Sorting:

haozheji / exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
☆58Updated last year
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
janphilippfranken / sami
Self-Supervised Alignment with Mutual Information
☆21Updated last year
RLHFlow / Directional-Preference-Alignment
Directional Preference Alignment
☆59Updated 10 months ago
liziniu / GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆35Updated 2 months ago
GXimingLu / IPA
Codebase for Inference-Time Policy Adapters
☆24Updated last year
iiis-ai / IterativeQuestionComposing
Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…
☆20Updated 7 months ago
RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
shadowkiller33 / Contrast-Instruction
☆19Updated last year
googleinterns / localizing-paragraph-memorization
☆14Updated last year
wzq016 / PINE
Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""
☆14Updated last month
Linear95 / DSP
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Updated last year
huiwy / reflection-on-trees
☆14Updated last year
XiangLi1999 / AutoBencher
☆29Updated last year
sail-sg / dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆44Updated 3 months ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆71Updated last year
genrm-star / genrm-critiques
GenRM-CoT: Data release for verification rationales
☆63Updated 9 months ago
tml-epfl / icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆31Updated 6 months ago
Shentao-YANG / Preference_Grounded_Guidance
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆16Updated 7 months ago
gregorbachmann / Next-Token-Failures
☆89Updated last year
YuxiXie / SelfEval-Guided-Decoding
☆100Updated last year
gl-ybnbxb / BoNBoN
☆18Updated last year
princeton-nlp / WhatICLLearns
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆21Updated 2 years ago
ruizheng20 / gpo
The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".
☆17Updated last year
deeplearning-wisc / args
☆43Updated last year
chang-github-00 / LLM-Predictive-Decoding
☆14Updated last month
LZhengisme / self-infilling
[ICML 2024] Self-Infilling Code Generation
☆18Updated last year
SIMONLQY / RethinkMCTS
☆28Updated 10 months ago
Edward-Sun / easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆123Updated 11 months ago
feiyang-k / AutoScale
Official Code Repository for [AutoScale–Automatic Prediction of Compute-optimal Data Compositions for Training LLMs]
☆12Updated 6 months ago