OpenMOSE / RWKV-LM-RLHF

Reinforcement Learning Toolkit for RWKV.(v6,v7) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning Let's boost the model's intelligence! currently under construction:)
21Updated this week

Alternatives and similar repositories for RWKV-LM-RLHF:

Users that are interested in RWKV-LM-RLHF are comparing it to the libraries listed below