alicank / Translation-Augmented-LibriSpeech-Corpus

Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.
43Updated 2 years ago

Alternatives and similar repositories for Translation-Augmented-LibriSpeech-Corpus:

Users that are interested in Translation-Augmented-LibriSpeech-Corpus are comparing it to the libraries listed below