camenduru / audioldm-colab
AudioLDM text to audio colab
☆19Updated last year
Alternatives and similar repositories for audioldm-colab:
Users that are interested in audioldm-colab are comparing it to the libraries listed below
- text-to-audio-latent-diffusion☆37Updated last year
- ☆27Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- ☆39Updated 11 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆41Updated 3 weeks ago
- Text prompt steered synthetic audio generators☆46Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 11 months ago
- ☆22Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 5 months ago
- ☆27Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆62Updated last year
- Real-time end-to-end singing voice convertion☆21Updated 5 months ago
- ☆14Updated last year
- Finally, some decent sample sentences☆22Updated last year
- ☆28Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- Music production for silent film clips.☆21Updated last month
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated last year
- Text-to-Music Generation with Rectified Flow Transformer☆8Updated 7 months ago
- Codebase and project page for EDMSound☆34Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 6 months ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Updated 3 weeks ago
- Fork of AudioLDM as a TuneFlow plugin☆39Updated 2 years ago
- ☆16Updated last year
- The demo page of UniAudio☆33Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆28Updated last year
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆42Updated 3 weeks ago