alicank / Translation-Augmented-LibriSpeech-Corpus

Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.
43Updated 2 years ago

Related projects

Alternatives and complementary repositories for Translation-Augmented-LibriSpeech-Corpus