RLHFlow / RAFTView on GitHub
This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or rejection sampling fine-tuning.
39Sep 22, 2024Updated last year

Alternatives and similar repositories for RAFT

Users that are interested in RAFT are comparing it to the libraries listed below

Sorting:

Are these results useful?