mfarisadip / T5-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architecture.
15Updated 2 years ago

Alternatives and similar repositories for T5-rlhf-pytorch:

Users that are interested in T5-rlhf-pytorch are comparing it to the libraries listed below