facebookresearch / speech_translation
Demo and samples for universal speech translator
☆23Updated 2 years ago
Alternatives and similar repositories for speech_translation
Users that are interested in speech_translation are comparing it to the libraries listed below
Sorting:
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 7 years ago
- ☆12Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ASR project with pytorch-lightning☆20Updated last month
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Updated 3 years ago
- A spoken question answering dataset on SQUAD☆47Updated 2 weeks ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- ☆11Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 7 years ago
- Tacotron2 with BERT examples☆10Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- ☆76Updated 3 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- Curriculum Vitae of Quan Wang☆15Updated 4 months ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago
- ☆20Updated 5 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 4 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 3 weeks ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆17Updated 5 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago