youmebangbang / deepvoice-model-utilitiesLinks
☆21Updated 2 years ago
Alternatives and similar repositories for deepvoice-model-utilities
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
Sorting:
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- A web app that lets you play around with TalkNet models☆121Updated 2 years ago
- NeMo: a toolkit for conversational AI☆11Updated 2 years ago
- ☆107Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Deep learning toolkit for image, video, and audio synthesis☆108Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- ☆52Updated 3 years ago
- Tools to train a generative model on arbitrary audio samples☆62Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- NVIDIA's TalkNET - Train on colab☆37Updated 2 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated last year
- ☆83Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- openai guided diffusion tweaks☆52Updated 2 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated 2 years ago
- Repo for structured dreaming☆55Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆122Updated last month
- Guided diffusion☆11Updated 3 years ago
- Code for "Jukebox: A Generative Model for Music"☆19Updated 5 years ago
- tools to manipulate audio with riffusion☆96Updated last year
- Discord AI Generation Bot to collect an aesthetic rating dataset☆60Updated 2 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Voice swapping with VQ-VAE and diffusion models☆67Updated 3 years ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆68Updated 3 years ago