xuyang-sudo / AutoRLAIF

AutoRLAIF is a cutting-edge framework designed to revolutionize the fine-tuning of large language models through Reinforcement Learning from AI Feedback (RLAIF).
117Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for AutoRLAIF