lokkelvin2 / dc_tts_GUILinks
GUI Wrapper for 'A TensorFlow Implementation of DC-TTS: yet another text-to-speech model'
β25Updated 5 years ago
Alternatives and similar repositories for dc_tts_GUI
Users that are interested in dc_tts_GUI are comparing it to the libraries listed below
Sorting:
- π¦π° A bot that uses Uberduck (and FakeYou) AI to make bit donations have an AI voice.β82Updated last year
- β62Updated 4 years ago
- Deep Learning technology to upscale music.β23Updated 5 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.β89Updated 4 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-spβ¦β57Updated 6 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversionβ146Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ15Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNetβ11Updated 2 years ago
- Fetches chat for a given Twitch video and creates a video replay for highlight editors.β45Updated 2 years ago
- Your one-stop solution for voice dataset creationβ125Updated last year
- Global Rhythm Style Transfer Without Text Transcriptionsβ284Updated 11 months ago
- [WIP] VoiceSmith makes training text to speech models easy.β226Updated 3 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) witβ¦β169Updated 5 years ago
- Collect Voice Conversion researchesβ94Updated this week
- β274Updated last year
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowβ129Updated 4 years ago
- Official Implementation of StyleTTS-VCβ191Updated 8 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ84Updated 2 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.β34Updated 7 months ago
- β130Updated 2 years ago
- Emotional Speech Conversion using Style Transfer and MUNITβ36Updated 6 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Networkβ321Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!β354Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated 2 years ago
- A pytorch implementation of StarGAN-VC2β149Updated 5 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Noβ¦β115Updated 4 years ago
- Heteronym to Phoneme Parserβ18Updated last year
- Obama singing any song. Check ReadMeβ10Updated 6 years ago
- β11Updated last year