RLHFlow / RAFTLinks

This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or rejection sampling fine-tuning.
31Updated 8 months ago

Alternatives and similar repositories for RAFT

Users that are interested in RAFT are comparing it to the libraries listed below

Sorting: