youmebangbang / deepvoice-model-utilities
☆21Updated last year
Alternatives and similar repositories for deepvoice-model-utilities:
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- ☆27Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated last year
- Generate images from texts. In Russian☆19Updated 3 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- openai guided diffusion tweaks☆52Updated 2 years ago
- Repo for structured dreaming☆55Updated 2 years ago
- AudioLDM text to audio colab☆19Updated last year
- ☆13Updated 2 years ago
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- ☆107Updated last year
- ☆11Updated last year
- Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.☆39Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 4 months ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated 2 years ago
- Frontend (and soon also midleware and backend) for a new, opensource image generation platform.☆14Updated 2 years ago
- Blender Keyframe Exporter for AI Animation☆13Updated 2 years ago
- A quick test using a Stable Diffusion server and Godot 4☆11Updated last year
- tools to manipulate audio with riffusion☆91Updated last year
- Deep learning toolkit for image, video, and audio synthesis☆108Updated 2 years ago
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆29Updated last year
- ☆20Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆10Updated last year
- Simple local all-in-one install for IDEA2.ART☆26Updated 2 years ago