liuchen6667 / qwen_grpo_gsm8kView on GitHub
简单易理解的代码,用于在qwen上使用grpo加强数学能力
50May 14, 2025Updated 9 months ago

Alternatives and similar repositories for qwen_grpo_gsm8k

Users that are interested in qwen_grpo_gsm8k are comparing it to the libraries listed below

Sorting:

Are these results useful?