multimodal-art-projection / AutoMVLinks
☆82Updated 3 weeks ago
Alternatives and similar repositories for AutoMV
Users that are interested in AutoMV are comparing it to the libraries listed below
Sorting:
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated last year
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- ☆147Updated 6 months ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆63Updated 7 months ago
- ☆132Updated 7 months ago
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆111Updated last month
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆299Updated 2 months ago
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆147Updated last month
- The homepage of LongCat-Video-Avatar☆128Updated last month
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community …☆58Updated this week
- ☆240Updated last month
- ☆114Updated 7 months ago
- Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"☆81Updated 9 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆32Updated 4 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆67Updated last year
- ☆77Updated 8 months ago
- LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models☆146Updated 7 months ago
- MOVA: Towards Scalable and Synchronized Video–Audio Generation☆292Updated this week
- Music production for silent film clips.☆32Updated 9 months ago
- DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework☆142Updated 5 months ago
- ☆34Updated 3 months ago
- Official Repo for MoCha Towards Movie-Grade Talking Character Synthesis☆61Updated last month
- ☆18Updated 7 months ago
- ☆72Updated 2 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆123Updated 8 months ago
- ☆62Updated 7 months ago
- ☆146Updated last month
- OpenVideo specializes in the domain of text-to-video generation, with the goal of providing high-quality and diverse video datasets to AI…☆113Updated 8 months ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆111Updated last month
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆150Updated 5 months ago