jackaduma / ChatGLM-LoRA-RLHF-PyTorchView on GitHub
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
140Apr 28, 2023Updated 2 years ago

Alternatives and similar repositories for ChatGLM-LoRA-RLHF-PyTorch

Users that are interested in ChatGLM-LoRA-RLHF-PyTorch are comparing it to the libraries listed below

Sorting:

Are these results useful?