apple / ml-sid-ditLinks
☆34Updated 3 months ago
Alternatives and similar repositories for ml-sid-dit
Users that are interested in ml-sid-dit are comparing it to the libraries listed below
Sorting:
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- ☆46Updated 2 months ago
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆60Updated 3 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Updated last year
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Updated last month
- Make self forcing endless. Add cache purging. Add prompt controllability.☆69Updated 4 months ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Updated 6 months ago
- Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation☆31Updated 6 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (ICLR 2026)☆40Updated 6 months ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆45Updated 2 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Updated 4 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆43Updated last year
- Generate image at any resolution.☆43Updated 4 months ago
- ☆46Updated last month
- ☆132Updated 7 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆131Updated 2 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Updated 3 weeks ago
- LIA-X: Interpretable Latent Portrait Animator☆97Updated 4 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Updated last year
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated last year
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆127Updated 2 weeks ago
- [ICCV 2025, Highlight] Official Pytorch implementation of the paper: "ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mi…☆36Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆50Updated 11 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated last year
- Blending Custom Photos with Video Diffusion Transformers☆48Updated last year
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Updated 2 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆46Updated 9 months ago
- ☆35Updated 8 months ago
- ☆107Updated 5 months ago
- ☆19Updated last year