alicank / Translation-Augmented-LibriSpeech-Corpus
Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.
☆43Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Translation-Augmented-LibriSpeech-Corpus
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆38Updated 2 years ago
- ☆14Updated 5 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆37Updated 4 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- End-to-end Speech Translation☆36Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- RNNs for Text Normalization☆38Updated 6 years ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.☆76Updated 3 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Updated 3 years ago
- Deep Learning systems for training and testing disfluency detection and related tasks on speech data.☆57Updated 5 years ago
- ☆45Updated 5 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆55Updated 6 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- Covering grammars for English and Russian text normalization☆60Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 4 years ago
- mWER loss implementation in tensorflow☆31Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19Updated 2 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- A spoken question answering dataset on SQUAD☆39Updated 2 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Speech2vec pre-trained word vectors☆77Updated 6 years ago