andrew-silva / mlx-rlhfLinks

An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
29Updated last year

Alternatives and similar repositories for mlx-rlhf

Users that are interested in mlx-rlhf are comparing it to the libraries listed below

Sorting: