TideDra / lmm-r1Links

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
770Updated last month

Alternatives and similar repositories for lmm-r1

Users that are interested in lmm-r1 are comparing it to the libraries listed below

Sorting: