justinjohn0306 / ControllableTalkNet
This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.
☆45Updated last year
Alternatives and similar repositories for ControllableTalkNet
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
Sorting:
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- A web app that lets you play around with TalkNet models☆119Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Audio datasets, easier.☆84Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated last year
- Full GUI Version☆31Updated 2 years ago
- A colab notebook for video super resolution using GFPGAN☆36Updated last year
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆13Updated 2 years ago
- NVIDIA's TalkNET - Train and Synthesize on colab☆14Updated 6 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- A latent text-to-image diffusion model☆67Updated 2 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆27Updated 3 years ago
- Generate morph sequences with Stable Diffusion. Interpolate between two or more prompts and create an image at each step.☆117Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆52Updated 2 years ago
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆20Updated 2 years ago
- k_diffusion wrapper included for k_lms sampling. fixed for notebook.☆20Updated last year
- ☆64Updated 4 years ago
- ☆88Updated last year
- Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.☆39Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆33Updated last year
- Dreambooth for colab☆32Updated last year
- Stable Diffusion Video to Video, Image to Image, Template Prompt Generation system and more, for use with any stable diffusion model☆23Updated 2 years ago
- A GUI for text2img diffusion, as a visual alternative to CLI and Jupyter Notebooks.☆29Updated 2 years ago
- Text prompt steered synthetic audio generators☆46Updated last month
- ☆51Updated 2 years ago
- ☆28Updated last year
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆30Updated last year