lucidrains / llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning
163Updated last month

Alternatives and similar repositories for llama-qrlhf:

Users that are interested in llama-qrlhf are comparing it to the libraries listed below