camenduru / FluxMusicLinks
Text-to-Music Generation with Rectified Flow Transformer
☆8Updated 9 months ago
Alternatives and similar repositories for FluxMusic
Users that are interested in FluxMusic are comparing it to the libraries listed below
Sorting:
- Text-to-Music Generation with Rectified Flow Transformer☆63Updated last week
- A text to audio pipeline using Riffusion (a finetuned stablediffusion model) and using RAVE a audio to audio AutoEncoder.☆16Updated 2 years ago
- ☆19Updated 9 months ago
- ☆22Updated 7 months ago
- ☆14Updated 11 months ago
- ☆39Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆38Updated 2 weeks ago
- Real-time end-to-end singing voice convertion☆22Updated 7 months ago
- AudioLDM text to audio colab☆18Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 7 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆45Updated 2 months ago
- ☆12Updated last year
- ☆27Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- Run AuraFlow on Replicate☆14Updated 10 months ago
- Build HTML artefacts with Ollama☆11Updated 5 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 7 months ago
- ☆16Updated last year
- ☆8Updated 9 months ago
- ☆15Updated 2 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆19Updated 7 months ago
- ☆24Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆18Updated 2 weeks ago
- ☆17Updated 4 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated 8 months ago
- ☆13Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆17Updated 5 months ago