SortAnon / ControllableTalkNet
A web app that lets you play around with TalkNet models
☆118Updated last year
Alternatives and similar repositories for ControllableTalkNet:
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Full GUI Version☆31Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆44Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- Audio datasets, easier.☆82Updated last year
- ☆147Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- A latent text-to-image diffusion model☆67Updated 2 years ago
- TorToiSe fine-tuning with DLAS☆218Updated 7 months ago
- AUTOMATIC1111 webUI + Krita Plugin with superb Inpainting☆88Updated 2 years ago
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆144Updated last year
- Riffusion extension for AUTOMATIC1111's SD Web UI☆201Updated last year
- Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation☆23Updated 2 years ago
- ☆98Updated 7 months ago
- a fork implementation of SIGGRAPH 2020 paper Interactive Video Stylization Using Few-Shot Patch-Based Training☆106Updated 2 years ago
- MMD2depth use MikuMikuDance model in Stable Diffusion 2.0 depth2img☆29Updated 2 years ago
- converts huggingface diffusers stablediffussion models to stablediffusion ckpt files usable in most opensource tools☆53Updated last year
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- ☆186Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- ☆53Updated 2 years ago
- RVC Inference with multiple model and huggingface support☆103Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- [CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting☆50Updated 2 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆52Updated 2 years ago
- NVIDIA's TalkNET - Train and Synthesize on colab☆14Updated 4 months ago