sam2125 / translatotron
☆42Updated 3 years ago
Alternatives and similar repositories for translatotron
Users that are interested in translatotron are comparing it to the libraries listed below
Sorting:
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- ☆163Updated 2 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆216Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆301Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆194Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆58Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆83Updated 2 years ago
- Example code for a neural transducer model.☆61Updated last year
- ☆43Updated 2 years ago
- ☆112Updated 3 years ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆13Updated 5 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆63Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆250Updated 4 months ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Updated 3 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 years ago
- ☆25Updated 2 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago
- An awesome spoken LID repository. (Working in progress☆102Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Predicts the level of noise and reverberation on your audiofiles☆149Updated 11 months ago
- A sequence-to-sequence voice conversion toolkit.☆97Updated 10 months ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆193Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago