TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
643Updated this week

Alternatives and similar repositories for lmm-r1:

Users that are interested in lmm-r1 are comparing it to the libraries listed below