openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation
85Updated this week

Related projects: