alicank / Translation-Augmented-LibriSpeech-CorpusLinks
Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.
☆44Updated 2 years ago
Alternatives and similar repositories for Translation-Augmented-LibriSpeech-Corpus
Users that are interested in Translation-Augmented-LibriSpeech-Corpus are comparing it to the libraries listed below
Sorting:
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆40Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 2 months ago
- ☆14Updated 6 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- Deep Learning systems for training and testing disfluency detection and related tasks on speech data.☆58Updated 6 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- A spoken question answering dataset on SQUAD☆49Updated last month
- Covering grammars for English and Russian text normalization☆61Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- Recurrent Neural Aligner☆50Updated 5 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 11 months ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- End-to-end Speech Translation☆36Updated 4 years ago
- RNNs for Text Normalization☆39Updated 7 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 5 years ago
- ☆24Updated 5 years ago
- Multilingual speech translation☆41Updated 4 years ago
- CMU multilingual speech repository☆31Updated 3 years ago
- A phoneme-allophone database for many languages☆52Updated 5 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆20Updated 3 years ago
- Spoken Language Translation System☆20Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- ☆16Updated 3 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Updated 7 years ago