tatsu-lab / alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
782Updated 4 months ago

Related projects

Alternatives and complementary repositories for alpaca_farm