facebookresearch / RLCD

Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
66Updated last year

Alternatives and similar repositories for RLCD:

Users that are interested in RLCD are comparing it to the libraries listed below