Tim-Siu / reinforcement-distillationLinks

Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
26Updated last month

Alternatives and similar repositories for reinforcement-distillation

Users that are interested in reinforcement-distillation are comparing it to the libraries listed below

Sorting: