natolambert / rlhf-book

Textbook on reinforcement learning from human feedback
22Updated last month

Related projects: