sanjeevanahilan / nanoChatGPT

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
287Updated 11 months ago

Related projects

Alternatives and complementary repositories for nanoChatGPT