facebookresearch / speech_translation
Demo and samples for universal speech translator
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speech_translation
- Proposed splits for the LREC Wikipron paper☆13Updated 4 years ago
- A spoken question answering dataset on SQUAD☆39Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆42Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- ☆74Updated 3 years ago
- ☆9Updated last year
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 4 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆39Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- ☆20Updated 3 years ago
- Text normalization scripts from IRISA lab☆12Updated 6 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated last year
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 4 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆38Updated 4 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- ☆33Updated 3 years ago