camenduru / FluxMusic
Text-to-Music Generation with Rectified Flow Transformer
☆8Updated 8 months ago
Alternatives and similar repositories for FluxMusic
Users that are interested in FluxMusic are comparing it to the libraries listed below
Sorting:
- Text-to-Music Generation with Rectified Flow Transformer☆62Updated 8 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- ☆14Updated 2 months ago
- AudioLDM text to audio colab☆19Updated last year
- ☆19Updated 8 months ago
- ☆12Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- Site for sharing MusicGen + AudioGen Prompts and Creations☆42Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Real-time end-to-end singing voice convertion☆21Updated 6 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆18Updated 6 months ago
- Build HTML artefacts with Ollama☆11Updated 5 months ago
- ☆39Updated last year
- ☆14Updated 10 months ago
- ☆27Updated last year
- ☆11Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 7 months ago
- Auto-Video maker handling many AI's☆10Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- Gradio UI for YuE☆47Updated last month
- A text to audio pipeline using Riffusion (a finetuned stablediffusion model) and using RAVE a audio to audio AutoEncoder.☆16Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- Codebase and project page for EDMSound☆34Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 7 months ago
- ☆22Updated 6 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆35Updated 3 weeks ago
- Open TTS models, built for streaming on the edge☆41Updated 2 months ago
- ☆11Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year