justinjohn0306 / TalkNET-colab
NVIDIA's TalkNET - Train and Synthesize on colab
☆14Updated 4 months ago
Alternatives and similar repositories for TalkNET-colab:
Users that are interested in TalkNET-colab are comparing it to the libraries listed below
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- A web app that lets you play around with TalkNet models☆118Updated last year
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- OpenAI MuseNet API Colab Notebook☆32Updated 2 years ago
- Colaboratory Notebook for Ultimate Vocal Remover☆92Updated 8 months ago
- AudioSR-Colab-Fork☆39Updated 3 months ago
- Tools to train a generative model on arbitrary audio samples☆62Updated 2 years ago
- GUI for Music-Source-Separation-Training☆11Updated 2 weeks ago
- A Python library and CLI for generating audio samples using Harmonai Dance Diffusion models.☆94Updated last year
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- ☆64Updated 4 years ago
- Animated music videos with beautiful audio-reactive visual effects.☆34Updated 2 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆27Updated 3 years ago
- Audio bandwidth enhancement with DNNs, addressing filter overfitting☆40Updated last year
- tools to manipulate audio with riffusion☆93Updated last year
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- GPT3-based Multi-Instrumental MIDI Music AI Implementation☆48Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- A two-stage U-Net for high-fidelity denoising of historical recordings☆102Updated 4 months ago
- ☆11Updated 3 weeks ago
- ☆78Updated last year
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Community framework for training tortoise☆41Updated 2 years ago
- ☆64Updated 11 months ago
- EbSynth is hard to use... Lot's of turning videos into image sequences, resizing style images to fit the original frames, renaming the st…☆40Updated last year
- Repo for structured dreaming☆55Updated 2 years ago
- ☆83Updated 2 years ago
- Flexible LoRA Implementation to use with stable-audio-tools☆66Updated 6 months ago