andrew-silva / mlx-rlhf

An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
20Updated 4 months ago

Related projects

Alternatives and complementary repositories for mlx-rlhf