sam2125 / translatotronLinks
☆44Updated 3 years ago
Alternatives and similar repositories for translatotron
Users that are interested in translatotron are comparing it to the libraries listed below
Sorting:
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- ☆163Updated 2 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- ☆67Updated 2 weeks ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆135Updated last year
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆96Updated 2 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated 2 years ago
- Example code for a neural transducer model.☆62Updated last year
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆134Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆74Updated 3 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆138Updated 8 months ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 3 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- Alignment files of LibriTTS.☆62Updated 5 years ago
- ☆111Updated 3 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆52Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆77Updated last year
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆116Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆81Updated 2 years ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆201Updated 2 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆110Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago