justinjohn0306 / ControllableTalkNetLinks

This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.

☆45

Alternatives and similar repositories for ControllableTalkNet

Users that are interested in ControllableTalkNet are comparing it to the libraries listed below

Sorting:

SortAnon / ControllableTalkNet
A web app that lets you play around with TalkNet models
☆122Updated 2 years ago
bycloudai / TalkNET-colab
NVIDIA's TalkNET - Train on colab
☆37Updated 2 years ago
ttuleyb / TortoiseTTS-GUI
GradioUI for TortoiseTTS voice generation
☆34Updated last year
Pranjalya / tts-tortoise-gradio
A Gradio setup for Tortoise TTS.
☆45Updated 2 years ago
devilismyfriend / ozen-toolkit
Audio datasets, easier.
☆84Updated 2 years ago
justinjohn0306 / TalkNET-colab
NVIDIA's TalkNET - Train and Synthesize on colab
☆14Updated 9 months ago
andreae293 / Dreambooth-Stable-Diffusion-cpu
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
☆13Updated 2 years ago
LumenPallidium / neural-file-sorter
A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…
☆29Updated 2 years ago
camenduru / one-shot-talking-face-colab
☆149Updated 2 years ago
jeremyssocial / EzEb
EbSynth is hard to use... Lot's of turning videos into image sequences, resizing style images to fit the original frames, renaming the st…
☆40Updated last year
richservo / StableDiffusionGUI
☆51Updated 2 years ago
zenforic / jukebox-win-local
Windows compatible code for the paper "Jukebox: A Generative Model for Music"
☆13Updated 2 years ago
bycloudai / PCAVS-Windows
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
☆27Updated 3 years ago
sadnow / AnimationKit-AI
AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE
☆119Updated 3 years ago
SortAnon / NeMo
NeMo: a toolkit for conversational AI
☆11Updated 2 years ago
AmericanPresidentJimmyCarter / stable-diffusion
A latent text-to-image diffusion model
☆67Updated 2 years ago
changjonathanc / anim_e
Anim·E, Anime Enhanced dalle mini
☆42Updated 2 years ago
pollinations / dance-diffusion
Tools to train a generative model on arbitrary audio samples
☆62Updated 2 years ago
newsbubbles / sdutils
Stable Diffusion Video to Video, Image to Image, Template Prompt Generation system and more, for use with any stable diffusion model
☆23Updated 2 years ago
GeeveGeorge / GFPGAN-for-Video-SR
A colab notebook for video super resolution using GFPGAN
☆36Updated 2 years ago
vincefav / stable-diffusion-lite
☆19Updated 2 years ago
JarodMica / audiosplitter_whisper
☆101Updated last year
noicevice / awesome-voice-cloning
☆63Updated 4 years ago
youmebangbang / TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…
☆52Updated 3 years ago
vzakharov / jukebox-webui
Google Colab-backed Web UI for creating music with OpenAI Jukebox
☆84Updated last year
GeeveGeorge / Stable-Craiyon
A colab notebook that combines Stable Diffusion + DALL-E Mini (Craiyon)
☆126Updated 2 years ago
maua-maua-maua / maua
Deep learning toolkit for image, video, and audio synthesis
☆108Updated 2 years ago
finetunej / gpt-neo_dungeon
Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B
☆61Updated 4 years ago
multimodalart / mindseye
MindsEye beta - ai art pilot
☆81Updated 3 years ago
justinjohn0306 / FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
☆165Updated 2 months ago