mnoukhov / async_rlhf

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
11Updated this week

Related projects

Alternatives and complementary repositories for async_rlhf