youmebangbang / deepvoice-model-utilitiesLinks
☆21Updated 2 years ago
Alternatives and similar repositories for deepvoice-model-utilities
Users that are interested in deepvoice-model-utilities are comparing it to the libraries listed below
Sorting:
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Repo for structured dreaming☆55Updated 3 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆19Updated 2 years ago
- ☆66Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Text prompt steered synthetic audio generators☆47Updated last month
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 4 years ago
- Finally, some decent sample sentences☆23Updated last year
- openai guided diffusion tweaks☆51Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- ☆11Updated last year
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆10Updated 2 years ago
- Discord AI Generation Bot to collect an aesthetic rating dataset☆60Updated 2 years ago
- Blender Keyframe Exporter for AI Animation☆12Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆112Updated 2 years ago
- AudioLDM text to audio colab☆18Updated last year
- Image restoration with neural networks but without learning.☆46Updated 3 years ago
- Demo for 2022 Interspeech☆29Updated 2 years ago
- ☆83Updated 2 years ago
- ☆27Updated last year
- ☆19Updated 3 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆15Updated last year
- Deep learning toolkit for image, video, and audio synthesis☆108Updated 2 years ago
- ☆107Updated last year