jackaduma / Vicuna-LoRA-RLHF-PyTorchLinks

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
215Updated last year

Alternatives and similar repositories for Vicuna-LoRA-RLHF-PyTorch

Users that are interested in Vicuna-LoRA-RLHF-PyTorch are comparing it to the libraries listed below

Sorting: