ryanrudes / YTTTS

The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
50Updated 3 years ago

Related projects: