feyzaakyurek / rl4f

Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
62Updated 4 months ago

Related projects

Alternatives and complementary repositories for rl4f