sam2125 / translatotron
☆42Updated 2 years ago
Related projects: ⓘ
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆56Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆86Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆178Updated 2 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated last year
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆194Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆78Updated last year
- ☆160Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆63Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆188Updated 2 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆111Updated 3 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆108Updated 2 months ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆144Updated 2 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆156Updated 3 years ago
- Example code for a neural transducer model.☆58Updated 7 months ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆276Updated 3 years ago
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆80Updated 3 years ago
- ☆57Updated 2 weeks ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆105Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆82Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆64Updated 11 months ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆33Updated 2 years ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆131Updated last year
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆92Updated last year
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆181Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆69Updated 3 years ago