ivcylc / OpenMusicLinks
OpenMusic: SOTA Text-to-music (TTM) Generation
☆602Updated last month
Alternatives and similar repositories for OpenMusic
Users that are interested in OpenMusic are comparing it to the libraries listed below
Sorting:
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆766Updated 2 weeks ago
- Repository of AudioX☆1,061Updated 3 months ago
- PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reas…☆939Updated 3 weeks ago
- ☆672Updated this week
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆253Updated last month
- Memory-Guided Diffusion for Expressive Talking Video Generation☆1,051Updated last week
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆307Updated last month
- The official HelloMeme GitHub site☆621Updated last month
- Awesome music generation model——MG²☆160Updated 4 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆615Updated last year
- ☆1,528Updated last week
- InspireMusic: A toolkit designed for music, song, and audio generation☆1,167Updated 2 months ago
- PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model☆650Updated last year
- The official Soundwave repository☆217Updated 4 months ago
- ☆512Updated last month
- ☆270Updated last month
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,296Updated 5 months ago
- ☆450Updated 2 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,090Updated 6 months ago
- ☆710Updated this week
- ☆266Updated last year
- YuE: Open Full-song Generation Foundation for the GPU Poor☆428Updated 5 months ago
- SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline☆252Updated 2 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆598Updated 4 months ago
- Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".☆570Updated 3 weeks ago
- This is the official repository for M2UGen☆497Updated 7 months ago
- NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms☆1,070Updated 3 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆179Updated 3 months ago
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆745Updated 2 months ago
- Official comfyui repository of Hellomeme☆368Updated last month