OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the possibilities for deeper fine-tuning of RWKV.
☆40Updated last week
Alternatives and similar repositories for RWKV-LM-RLHF
Users that are interested in RWKV-LM-RLHF are comparing it to the libraries listed below
Sorting:
- ☆121Updated 3 weeks ago
- This project is established for real-time training of the RWKV model.