camenduru / FluxMusicLinks

Text-to-Music Generation with Rectified Flow Transformer

☆8

Alternatives and similar repositories for FluxMusic

Users that are interested in FluxMusic are comparing it to the libraries listed below

Sorting:

curtified / FluxMusicGUI
Text-to-Music Generation with Rectified Flow Transformer
☆64Updated last month
camenduru / FluxMusic-jupyter
☆19Updated 10 months ago
justinjohn0306 / SpeedScribe
High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…
☆10Updated 8 months ago
rsxdalv / musicgen-prompts
Site for sharing MusicGen + AudioGen Prompts and Creations
☆45Updated 3 months ago
Haurrus / xtts-trainer-no-ui-auto
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …
☆13Updated 9 months ago
fakerybakery / OpenF5-TTS
(WIP) A retrain of F5-TTS on permissively-licensed data
☆11Updated 3 months ago
lucasnewman / e2-tts-mlx
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆20Updated 9 months ago
camenduru / audioldm-colab
AudioLDM text to audio colab
☆19Updated last year
dioneapp / dioneapp
Explore, Install, Innovate — in 1 Click.
☆27Updated this week
camenduru / styletts-colab
☆39Updated last year
jmoso13 / jukebox-diffusion
☆107Updated last year
serp-ai / ai-text-to-audio-latent-diffusion
text-to-audio-latent-diffusion
☆37Updated last year
camenduru / mimic-motion-tost
☆23Updated 8 months ago
d3n7 / riffusionPrepper
Prepare spectrograms from audio for training a Riffusion model
☆15Updated 2 years ago
taresh18 / TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆90Updated last month
Bill13579 / beltout
BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC
☆63Updated this week
Stability-AI / stable-audio-demo
☆11Updated last year
neonbjb / pyfastmp3decoder
A fast MP3 decoder for python, using minimp3
☆29Updated 2 years ago
camenduru / singing-voice-conversion-colab
☆27Updated last year
Djmcflush / RaveFussion
A text to audio pipeline using Riffusion (a finetuned stablediffusion model) and using RAVE a audio to audio AutoEncoder.
☆16Updated 2 weeks ago
JarodMica / GPT-SoVITS-Package
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆26Updated last month
tuneflow / AudioLDM
Fork of AudioLDM as a TuneFlow plugin
☆42Updated 2 years ago
jakariaemon / WSI
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
☆20Updated 4 months ago
asigalov61 / Giant-Music-Transformer
[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…
☆87Updated 6 months ago
joeljuvel / YuE-UI
Gradio UI for YuE
☆65Updated 3 months ago
camenduru / InstantID-IPAdapter-ControlNet-jupyter
☆24Updated last year
JarodMica / tortoise_dataset_tools
Misc. tools/scripts that I made to use for tortoise
☆21Updated 10 months ago
korakoe / VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆16Updated last year
camenduru / PIA-colab
☆24Updated last year
sammcj / ollama-artefacts
Build HTML artefacts with Ollama
☆11Updated 7 months ago