andrew-silva / mlx-rlhfView on GitHub
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
38Jun 21, 2024Updated last year

Alternatives and similar repositories for mlx-rlhf

Users that are interested in mlx-rlhf are comparing it to the libraries listed below

Sorting:

Are these results useful?