TideDra / lmm-r1Links

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
758Updated 2 weeks ago

Alternatives and similar repositories for lmm-r1

Users that are interested in lmm-r1 are comparing it to the libraries listed below

Sorting: