SortAnon / ControllableTalkNet
A web app that lets you play around with TalkNet models
☆119Updated last year
Alternatives and similar repositories for ControllableTalkNet:
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
- NVIDIA's TalkNET - Train on colab☆38Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Full GUI Version☆30Updated last year
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- Audio datasets, easier.☆82Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated last year
- [CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting☆50Updated 2 years ago
- A latent text-to-image diffusion model☆67Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 9 months ago
- Using RVC via console or python scripts☆109Updated 3 months ago
- ☆147Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- ☆158Updated 2 years ago
- TorToiSe fine-tuning with DLAS☆218Updated 6 months ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- AudioSR-Colab-Fork☆38Updated last month
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆27Updated 3 years ago
- EbSynth is hard to use... Lot's of turning videos into image sequences, resizing style images to fit the original frames, renaming the st…☆40Updated last year
- a fork implementation of SIGGRAPH 2020 paper Interactive Video Stylization Using Few-Shot Patch-Based Training☆106Updated 2 years ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆56Updated 4 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆118Updated 11 months ago
- ☆99Updated 6 months ago
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- Tacotron2 Training Notebook for FakeYou.com☆163Updated 2 months ago
- Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation☆23Updated 2 years ago
- ☆88Updated 10 months ago
- ☆90Updated 2 years ago
- Applies mirroring and flips to the latent images to produce anything from subtle balanced compositions to perfect reflections☆114Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆142Updated last year
- RVC Inference with multiple model and huggingface support☆102Updated 11 months ago