facebookresearch / rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
38Updated 10 months ago

Related projects

Alternatives and complementary repositories for rlfh-gen-div