sam2125 / translatotron
☆42Updated 2 years ago
Alternatives and similar repositories for translatotron:
Users that are interested in translatotron are comparing it to the libraries listed below
- ☆163Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 3 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆193Updated 2 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆87Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated 2 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- ☆111Updated 2 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆74Updated 3 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆94Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆111Updated 3 years ago
- ☆65Updated last month
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆208Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆143Updated 7 months ago
- An 16kHz implementation of HiFi-GAN for soft-vc.☆95Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆105Updated 2 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆113Updated 4 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆83Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆226Updated this week
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆132Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 6 months ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)☆268Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆233Updated last year
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆133Updated last year