Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"
☆82Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for MusicInfuser
Users that are interested in MusicInfuser are comparing it to the libraries listed below
Sorting:
- Official implementation of "ControlFace: Harnessing Facial Parametric Control for Face Rigging".☆42Mar 5, 2025Updated 11 months ago
- Official implementation of "Visual Persona: Foundation Model for Full-Body Human Customization" (CVPR 2025)☆45Feb 20, 2026Updated last week
- Official Implementation of "Multi-Granularity Video Object Segmentation" (AAAI 2025)☆25Dec 20, 2024Updated last year
- A precise and stable CFG for negative prompts, derived via guided sampling with contrastive loss.☆14Dec 27, 2024Updated last year
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Jan 29, 2026Updated last month
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆53Oct 7, 2024Updated last year
- Official Implementation of "Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry"☆31Nov 10, 2025Updated 3 months ago
- Threestudio extension of the paper "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation".☆48Mar 12, 2024Updated last year
- Official code implementation of MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation (AAAI'23)☆71Jan 8, 2023Updated 3 years ago
- Official implementation of "InterRVOS: Interaction-aware Referring Video Object Segmentation".☆26Dec 31, 2025Updated 2 months ago
- Official Implementation of "MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation" (AAAI 2025)☆175Jan 14, 2025Updated last year
- Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".☆40Jun 2, 2025Updated 9 months ago
- [ICCV 2025] DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness☆174Feb 11, 2026Updated 2 weeks ago
- Official implementation of "AnthroTAP: Learning Point Tracking with Real-World Motion"☆25Feb 22, 2026Updated last week
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95May 22, 2025Updated 9 months ago
- Universal-Noise Annotation☆24Dec 23, 2023Updated 2 years ago
- Official Implementation of "Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation"☆46Jan 29, 2026Updated last month
- Official implementation of the paper "Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention" (Neu…☆136Oct 3, 2024Updated last year
- Official implementation of "AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild" (ICCV 2025)☆27Jul 8, 2025Updated 7 months ago
- Official implementation of "Seurat: From Moving Points to Depth", CVPR 2025 Highlight☆68Apr 9, 2025Updated 10 months ago
- [Wild3D @ ICCVW'25] Official implementation of "SE-NeRF : Self-Evolving Neural Radiance Fields"☆43Sep 15, 2025Updated 5 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Sep 11, 2024Updated last year
- Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling [ICCV 2025] Official PyTorch implementation☆33Nov 11, 2025Updated 3 months ago
- Official implementation of "Retrieval-Augmented Score Distillation for Text-to-3D Generation"☆54Dec 13, 2024Updated last year
- Loop your image from output to input in your ComfyUI workflow☆14Jan 16, 2026Updated last month
- ComfyUI implementation of FlashFace: Human Image Personalization with High-fidelity Identity Preservation☆26Jul 31, 2024Updated last year
- ☆27Dec 26, 2023Updated 2 years ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆27Sep 12, 2024Updated last year
- [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Tho…☆1,163Jan 27, 2026Updated last month
- Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling☆29Sep 17, 2024Updated last year
- Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)☆17Mar 17, 2025Updated 11 months ago
- [NeurIPS 2024] SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models☆207Jan 24, 2025Updated last year
- Official implementation of "S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM" (ICCV 2025)☆38Jul 15, 2025Updated 7 months ago
- The official repository for DreamSampler (ECCV24)☆37Oct 11, 2024Updated last year
- Differentiable Augmentation for Data-Efficient GAN Training☆11Aug 9, 2020Updated 5 years ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆46Sep 19, 2025Updated 5 months ago
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (…☆174Feb 27, 2024Updated 2 years ago
- [CVPR 2024 (Highlight)] Unifying Correspondence, Pose and NeRF for Generalized Pose-Free Novel View Synthesis from Stereo Pairs☆121Apr 5, 2024Updated last year
- Official repository for CATs++: Boosting Cost Aggregation with Convolutions and Transformers (TPAMI'22)☆49Jan 10, 2024Updated 2 years ago