XU-YIJIE / grpo-flat

Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...
65Updated last month

Alternatives and similar repositories for grpo-flat:

Users that are interested in grpo-flat are comparing it to the libraries listed below