ivcylc / OpenMusic
OpenMusic: SOTA Text-to-music (TTM) Generation
☆531Updated last week
Alternatives and similar repositories for OpenMusic:
Users that are interested in OpenMusic are comparing it to the libraries listed below
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆663Updated last week
- The official HelloMeme GitHub site☆573Updated last week
- Awesome music generation model——MG²☆141Updated 3 weeks ago
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆903Updated this week
- Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks☆1,092Updated this week
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音 效 😝☆527Updated 7 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆259Updated 3 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆583Updated 4 months ago
- [CVPR 2025🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆609Updated this week
- PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model☆632Updated 9 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆740Updated last month
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,649Updated last month
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,055Updated 3 weeks ago
- This is the official repository for M2UGen☆475Updated 2 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆380Updated 2 weeks ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆882Updated 4 months ago
- Interface for OuteTTS models.☆940Updated 2 weeks ago
- Official comfyui repository of Hellomeme☆332Updated last week
- Diffusion-based Portrait and Animal Animation☆682Updated last month
- Mustango: Toward Controllable Text-to-Music Generation☆354Updated 7 months ago
- talking-face video editing☆257Updated this week
- [CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos…☆1,172Updated this week
- Generative models for conditional audio generation☆140Updated 3 weeks ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆306Updated 10 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆204Updated 2 weeks ago