SortAnon / ControllableTalkNetLinks
A web app that lets you play around with TalkNet models
☆124Updated 2 years ago
Alternatives and similar repositories for ControllableTalkNet
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
Sorting:
- GradioUI for TortoiseTTS voice generation☆34Updated 2 years ago
- Audio datasets, easier.☆85Updated 2 years ago
- NVIDIA's TalkNET - Train on colab☆37Updated 2 years ago
- NeMo: a toolkit for conversational AI☆11Updated 2 years ago
- TorToiSe fine-tuning with DLAS☆227Updated last year
- A latent text-to-image diffusion model☆67Updated 3 years ago
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- Full GUI Version☆31Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- Tacotron2 Training Notebook for FakeYou.com☆164Updated 2 months ago
- RVC Inference with multiple model and huggingface support☆110Updated last month
- ☆100Updated last year
- ☆148Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆123Updated 5 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- DLAS - A configuration-driven trainer for generative models☆143Updated 3 years ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆146Updated last year
- Unofficial GUI implementation of Simswap☆36Updated 2 years ago
- ☆159Updated 2 years ago
- a fork implementation of SIGGRAPH 2020 paper Interactive Video Stylization Using Few-Shot Patch-Based Training☆106Updated 3 years ago
- badly coded gui for a quick streamlined workflow to produce 512x512 images suitable to train Stable Diffusion☆31Updated 3 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆69Updated 6 months ago
- ☆91Updated 3 years ago
- Community framework for training tortoise☆44Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Riffusion extension for AUTOMATIC1111's SD Web UI☆203Updated 2 years ago
- A multi-voice TTS system trained with an emphasis on quality☆109Updated 3 years ago
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆20Updated 3 years ago
- ☆63Updated 4 years ago