RLHFlow / RAFT

This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or rejection sampling fine-tuning.
22Updated 4 months ago

Alternatives and similar repositories for RAFT:

Users that are interested in RAFT are comparing it to the libraries listed below