justinjohn0306 / TalkNET-colab
NVIDIA's TalkNET - Train and Synthesize on colab
☆14Updated 5 months ago
Alternatives and similar repositories for TalkNET-colab:
Users that are interested in TalkNET-colab are comparing it to the libraries listed below
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- NVIDIA's TalkNET - Train on colab☆38Updated 2 years ago
- OpenAI MuseNet API Colab Notebook☆32Updated 2 years ago
- GPT3-based Multi-Instrumental MIDI Music AI Implementation☆48Updated 2 years ago
- A Python library and CLI for generating audio samples using Harmonai Dance Diffusion models.☆94Updated last year
- tools to manipulate audio with riffusion☆93Updated last year
- Colaboratory Notebook for Ultimate Vocal Remover☆93Updated 8 months ago
- A web app that lets you play around with TalkNet models☆118Updated last year
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- Audio bandwidth enhancement with DNNs, addressing filter overfitting☆40Updated last year
- Tools to train a generative model on arbitrary audio samples☆62Updated 2 years ago
- Animated music videos with beautiful audio-reactive visual effects.☆34Updated 2 years ago
- [Exclusive for GitHub] deep-muse: Advanced Text-to-Music Generator Implementation☆16Updated 3 years ago
- GUI toolkit using various audio diffusion repos.☆74Updated last year
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆78Updated 4 years ago
- Audio generation using diffusion models, in PyTorch.☆47Updated last year
- AudioSR-Colab-Fork☆41Updated 3 months ago
- fine-tuning MusicGen without prompts to generate music with a specific style☆62Updated last year
- Google Colab-backed Web UI for creating music with OpenAI Jukebox☆84Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- ☆21Updated 2 years ago
- ☆14Updated 3 years ago
- ☆12Updated last month
- Community framework for training tortoise☆41Updated 2 years ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆119Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- Text prompt steered synthetic audio generators☆46Updated last year
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆66Updated 3 years ago
- ☆64Updated 4 years ago