EIT-NLP / AccuracyParadox-RLHFView on GitHub
[EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models". (by Yanjun Chen)
13Nov 11, 2024Updated last year

Alternatives and similar repositories for AccuracyParadox-RLHF

Users that are interested in AccuracyParadox-RLHF are comparing it to the libraries listed below

Sorting:

Are these results useful?