tlc4418 / llm_optimization

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
35Updated 2 weeks ago

Alternatives and similar repositories for llm_optimization:

Users that are interested in llm_optimization are comparing it to the libraries listed below