youmebangbang / deepvoice-model-utilitiesLinks
☆20Updated 2 years ago
Alternatives and similar repositories for deepvoice-model-utilities
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
Sorting:
- A web app that lets you play around with TalkNet models☆123Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated 2 years ago
- Deep learning toolkit for image, video, and audio synthesis☆107Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- NeMo: a toolkit for conversational AI☆11Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- NVIDIA's TalkNET - Train on colab☆37Updated 2 years ago
- A notebook for text-based guided image generation using StyleGANXL and CLIP.☆59Updated 2 years ago
- ☆52Updated 3 years ago
- Guided diffusion☆11Updated 3 years ago
- Start here☆111Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆60Updated 3 years ago
- FILM: Frame Interpolation for Large Motion, In arXiv 2022.☆29Updated 3 years ago
- ☆84Updated 3 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 3 years ago
- CLIP and PASTE: Using AI to Create Photo Collages from Text Prompts☆29Updated 3 years ago
- openai guided diffusion tweaks☆51Updated 3 years ago
- Discord AI Generation Bot to collect an aesthetic rating dataset☆58Updated 2 years ago
- Tools for smoothly interpolating between prompts for Stable Diffusion models☆58Updated 3 years ago
- ☆19Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- ☆26Updated 4 years ago
- AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE☆119Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- ☆106Updated last year
- Voice swapping with VQ-VAE and diffusion models☆67Updated 3 years ago
- Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab.☆23Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆120Updated 3 months ago