youmebangbang / deepvoice-model-utilitiesLinks
☆20Updated 2 years ago
Alternatives and similar repositories for deepvoice-model-utilities
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
Sorting:
- A web app that lets you play around with TalkNet models☆122Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- NeMo: a toolkit for conversational AI☆11Updated 2 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 2 years ago
- Deep learning toolkit for image, video, and audio synthesis☆108Updated 2 years ago
- Tools to train a generative model on arbitrary audio samples☆62Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- ☆107Updated last year
- AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE☆119Updated 3 years ago
- tools to manipulate audio with riffusion☆96Updated last year
- Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab.☆23Updated 3 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- ☆52Updated 3 years ago
- A notebook for text-based guided image generation using StyleGANXL and CLIP.☆59Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- ☆20Updated 4 years ago
- DLAS - A configuration-driven trainer for generative models☆139Updated 2 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Repo for structured dreaming☆55Updated 3 years ago
- A colab notebook for video super resolution using GFPGAN☆36Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Tools for smoothly interpolating between prompts for Stable Diffusion models☆59Updated 3 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 3 years ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆61Updated 3 years ago
- Community framework for training tortoise☆43Updated 2 years ago