mnoukhov / async_rlhf

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
13Updated this week

Related projects

Alternatives and complementary repositories for async_rlhf