andrew-silva / mlx-rlhf

An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
22Updated 6 months ago

Alternatives and similar repositories for mlx-rlhf:

Users that are interested in mlx-rlhf are comparing it to the libraries listed below