SortAnon / ControllableTalkNetLinks
A web app that lets you play around with TalkNet models
☆121Updated 2 years ago
Alternatives and similar repositories for ControllableTalkNet
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
Sorting:
- Audio datasets, easier.☆84Updated last year
- NeMo: a toolkit for conversational AI☆11Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- TorToiSe fine-tuning with DLAS☆225Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- A latent text-to-image diffusion model☆67Updated 2 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated 2 years ago
- Full GUI Version☆31Updated 2 years ago
- ☆149Updated 2 years ago
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- RVC Inference with multiple model and huggingface support☆105Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆122Updated last month
- ☆160Updated 2 years ago
- Tacotron2 Training Notebook for FakeYou.com☆165Updated last month
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- a fork implementation of SIGGRAPH 2020 paper Interactive Video Stylization Using Few-Shot Patch-Based Training☆106Updated 2 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated 2 months ago
- Riffusion extension for AUTOMATIC1111's SD Web UI☆200Updated 2 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- ☆101Updated 11 months ago
- DLAS - A configuration-driven trainer for generative models☆140Updated 2 years ago
- Generate morph sequences with Stable Diffusion. Interpolate between two or more prompts and create an image at each step.☆118Updated last year
- EbSynth is hard to use... Lot's of turning videos into image sequences, resizing style images to fit the original frames, renaming the st…☆40Updated last year
- Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation☆23Updated 2 years ago
- AUTOMATIC1111 webUI + Krita Plugin with superb Inpainting☆87Updated 2 years ago
- MMD2depth use MikuMikuDance model in Stable Diffusion 2.0 depth2img☆29Updated 2 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆27Updated 3 years ago
- A multi-voice TTS system trained with an emphasis on quality☆109Updated 3 years ago
- converts huggingface diffusers stablediffussion models to stablediffusion ckpt files usable in most opensource tools☆53Updated 2 years ago