andrew-silva / mlx-rlhfLinks
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
☆33Updated last year
Alternatives and similar repositories for mlx-rlhf
Users that are interested in mlx-rlhf are comparing it to the libraries listed below
Sorting: