ash80 / RLHF_in_notebooksLinks

RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
111Updated 3 weeks ago

Alternatives and similar repositories for RLHF_in_notebooks

Users that are interested in RLHF_in_notebooks are comparing it to the libraries listed below

Sorting: