youmebangbang / deepvoice-model-utilitiesLinks
☆20Updated 2 years ago
Alternatives and similar repositories for deepvoice-model-utilities
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
Sorting:
- GradioUI for TortoiseTTS voice generation☆34Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 3 years ago
- A web app that lets you play around with TalkNet models☆124Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- NeMo: a toolkit for conversational AI☆11Updated 3 years ago
- ☆107Updated 2 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 3 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆123Updated 6 months ago
- tools to manipulate audio with riffusion☆95Updated 2 years ago
- openai guided diffusion tweaks☆52Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Updated 2 years ago
- A latent text-to-image diffusion model☆67Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- ☆52Updated 3 years ago
- Community framework for training tortoise☆44Updated 3 years ago
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- Tools to train a generative model on arbitrary audio samples☆65Updated 3 years ago
- Deep learning toolkit for image, video, and audio synthesis☆108Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Audio datasets, easier.☆86Updated 2 years ago
- ☆83Updated 3 years ago
- Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab.☆23Updated 3 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆130Updated 2 years ago
- Resonance: Audio-Image Interconversion for AI Diffusion Models☆39Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year