TideDra / lmm-r1
View external linksLinks

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
840May 14, 2025Updated 9 months ago

Alternatives and similar repositories for lmm-r1

Users that are interested in lmm-r1 are comparing it to the libraries listed below

Sorting:

Are these results useful?