mingyin0312 / RL4LLM

RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct
27Updated last month

Alternatives and similar repositories for RL4LLM:

Users that are interested in RL4LLM are comparing it to the libraries listed below