youmebangbang / deepvoice-model-utilitiesLinks
☆20Updated 2 years ago
Alternatives and similar repositories for deepvoice-model-utilities
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
Sorting:
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated 2 years ago
- A web app that lets you play around with TalkNet models☆124Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- NeMo: a toolkit for conversational AI☆11Updated 2 years ago
- ☆52Updated 3 years ago
- ☆83Updated 3 years ago
- Deep learning toolkit for image, video, and audio synthesis☆108Updated 2 years ago
- NVIDIA's TalkNET - Train on colab☆37Updated 2 years ago
- Guided diffusion☆11Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Discord AI Generation Bot to collect an aesthetic rating dataset☆58Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.☆40Updated 3 years ago
- openai guided diffusion tweaks☆52Updated 3 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 3 years ago
- A notebook for text-based guided image generation using StyleGANXL and CLIP.☆58Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆130Updated 2 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 3 years ago
- Audio datasets, easier.☆86Updated 2 years ago
- A latent text-to-image diffusion model☆67Updated 3 years ago
- CLIP and PASTE: Using AI to Create Photo Collages from Text Prompts☆29Updated 3 years ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆60Updated 3 years ago
- Anim·E, Anime Enhanced dalle mini☆40Updated 3 years ago
- Voice swapping with VQ-VAE and diffusion models☆68Updated 4 years ago
- tools to manipulate audio with riffusion☆95Updated 2 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE☆119Updated 3 years ago