jackaduma / Alpaca-LoRA-RLHF-PyTorch

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
β˜†58Updated last year

Alternatives and similar repositories for Alpaca-LoRA-RLHF-PyTorch:

Users that are interested in Alpaca-LoRA-RLHF-PyTorch are comparing it to the libraries listed below