HUIZ-A / SVA
☆19Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for SVA
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆39Updated last month
- ☆46Updated 3 months ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆75Updated 11 months ago
- ☆30Updated 3 weeks ago
- ☆2Updated last month
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆54Updated 3 weeks ago
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated 10 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆34Updated 2 months ago
- A example pipeline to use InstructPix2Pix and the associated fine-tuned motion module☆31Updated last year
- ☆75Updated 10 months ago
- Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models☆160Updated 5 months ago
- A retrain of AnimateDiff to be conditional on an init image☆33Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated last month
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆108Updated 4 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆11Updated 2 months ago
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆50Updated last month
- ☆64Updated 5 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆61Updated 7 months ago
- Preprocessing Scipts for Talking Face Generation☆70Updated 3 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆21Updated 3 months ago
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆84Updated 3 weeks ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆50Updated last month
- ☆18Updated 7 months ago
- ☆40Updated 3 months ago
- Anim-400K: A dataset designed from the ground up for automated dubbing of video☆98Updated 4 months ago
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community …☆54Updated this week