justinjohn0306 / ControllableTalkNetLinks
This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.
☆45Updated last year
Alternatives and similar repositories for ControllableTalkNet
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
Sorting:
- A web app that lets you play around with TalkNet models☆121Updated last year
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- NVIDIA's TalkNET - Train and Synthesize on colab☆14Updated 8 months ago
- A latent text-to-image diffusion model☆67Updated 2 years ago
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- Tacotron2 Training Notebook for FakeYou.com☆164Updated 3 weeks ago
- A colab notebook for video super resolution using GFPGAN☆36Updated 2 years ago
- Audio datasets, easier.☆84Updated last year
- ☆149Updated 2 years ago
- ☆51Updated 2 years ago
- EbSynth is hard to use... Lot's of turning videos into image sequences, resizing style images to fit the original frames, renaming the st…☆40Updated last year
- Anim·E, Anime Enhanced dalle mini☆42Updated 2 years ago
- a fork implementation of SIGGRAPH 2020 paper Interactive Video Stylization Using Few-Shot Patch-Based Training☆106Updated 2 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆27Updated 3 years ago
- A notebook for text-based guided image generation using StyleGANXL and CLIP.☆59Updated 2 years ago
- Full GUI Version☆31Updated 2 years ago
- Google Colab-backed Web UI for creating music with OpenAI Jukebox☆84Updated last year
- Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B☆61Updated 3 years ago
- Tools to train a generative model on arbitrary audio samples☆62Updated 2 years ago
- Yet Another Stable Diffusion Discord Bot☆112Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- Generate morph sequences with Stable Diffusion. Interpolate between two or more prompts and create an image at each step.☆118Updated last year
- ☆160Updated 2 years ago
- ☆19Updated 2 years ago
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆21Updated 3 years ago
- A Cog implementation of the Real-ESRGAN super-resolution model from ruDALL-E.☆32Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆122Updated last month
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- ☆83Updated 2 years ago