Joyce94 / LLM-RLHF-TuningView on GitHub
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
453Oct 11, 2023Updated 2 years ago

Alternatives and similar repositories for LLM-RLHF-Tuning

Users that are interested in LLM-RLHF-Tuning are comparing it to the libraries listed below

Sorting:

Are these results useful?