tlc4418 / llm_optimization

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
31Updated 8 months ago

Related projects

Alternatives and complementary repositories for llm_optimization