huggingface / trl

Train transformer language models with reinforcement learning.
β˜†10,086Updated this week

Related projects β“˜

Alternatives and complementary repositories for trl