sam2125 / translatotron
☆42Updated 3 years ago
Alternatives and similar repositories for translatotron:
Users that are interested in translatotron are comparing it to the libraries listed below
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆134Updated last year
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Updated 3 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆58Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated last year
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆212Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆118Updated 2 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆192Updated 3 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆194Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆142Updated last year
- ☆163Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆134Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆96Updated 8 months ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆96Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 9 months ago
- ☆112Updated 2 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 3 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆244Updated 3 years ago
- ☆80Updated 10 months ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆298Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆62Updated 2 years ago