jinpz / q_sharp

The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training
11Updated 2 months ago

Alternatives and similar repositories for q_sharp

Users that are interested in q_sharp are comparing it to the libraries listed below

Sorting: