ivcylc / OpenMusic
OpenMusic: SOTA Text-to-music (TTM) Generation
☆559Updated 2 weeks ago
Alternatives and similar repositories for OpenMusic
Users that are interested in OpenMusic are comparing it to the libraries listed below
Sorting:
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆717Updated 2 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆269Updated 3 weeks ago
- ☆223Updated last month
- PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model☆645Updated 11 months ago
- The official HelloMeme GitHub site☆599Updated last month
- ☆786Updated last week
- Awesome music generation model——MG²☆154Updated last month
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,216Updated 2 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆578Updated 9 months ago
- The official Soundwave repository☆202Updated last month
- Unified automatic quality assessment for speech, music, and sound.☆478Updated last week
- YuE: Open Full-song Generation Foundation for the GPU Poor☆385Updated 3 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,074Updated 3 months ago
- Official comfyui repository of Hellomeme☆346Updated last month
- [CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos…☆1,302Updated 2 weeks ago
- Generative models for conditional audio generation☆151Updated 3 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆700Updated 6 months ago
- ☆107Updated 2 months ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆519Updated 2 weeks ago
- PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.☆467Updated 2 weeks ago
- Diffusion-based Portrait and Animal Animation☆770Updated 2 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆813Updated 3 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆406Updated 8 months ago
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆214Updated 3 weeks ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆321Updated last year
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆1,086Updated this week
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆683Updated 3 weeks ago
- ☆289Updated last week
- Fine-tune Stable Audio Open with DiT ControlNet.☆220Updated 3 months ago
- ☆304Updated last week