justinjohn0306 / ControllableTalkNet
This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.
☆45Updated last year
Alternatives and similar repositories for ControllableTalkNet:
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- A web app that lets you play around with TalkNet models☆118Updated last year
- NVIDIA's TalkNET - Train and Synthesize on colab☆14Updated 5 months ago
- A Gradio setup for Tortoise TTS.☆45Updated last year
- Full GUI Version☆31Updated last year
- A utility that downloads your Stable Diffusion images from discord and lets you preview them with Streamlit☆15Updated 2 years ago
- Audio datasets, easier.☆84Updated last year
- A colab notebook for video super resolution using GFPGAN☆35Updated last year
- EbSynth is hard to use... Lot's of turning videos into image sequences, resizing style images to fit the original frames, renaming the st…☆40Updated last year
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆20Updated 2 years ago
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆27Updated 3 years ago
- AI video temporal coherence Lab☆56Updated 2 years ago
- Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B☆64Updated 3 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- Tools to train a generative model on arbitrary audio samples☆62Updated 2 years ago
- ☆147Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation☆22Updated 2 years ago
- Text to Video☆26Updated 2 years ago
- ☆88Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- a fork implementation of SIGGRAPH 2020 paper Interactive Video Stylization Using Few-Shot Patch-Based Training☆106Updated 2 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆52Updated 2 years ago
- Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.☆39Updated 2 years ago
- A Cog implementation of the Real-ESRGAN super-resolution model from ruDALL-E.☆32Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- Stable Diffusion web UI☆19Updated 2 years ago