iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
☆186Dec 1, 2025Updated 3 months ago
Alternatives and similar repositories for iMontage
Users that are interested in iMontage are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] The official implementation of "RegionE: Adaptive Region-Aware Generation for Efficient Image Editing"☆82Feb 3, 2026Updated last month
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆235Aug 22, 2025Updated 6 months ago
- [CVPR 2026] ViStoryBench: AI Story Visualization Benchmark☆137Updated this week
- https://github.com/xie-lab-ml/Golden-Noise-for-Diffusion-Models for ComfyUI☆17Dec 10, 2024Updated last year
- ☆26Jun 22, 2024Updated last year
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆727Nov 27, 2025Updated 3 months ago
- The official UniVerse-1 code.☆122Oct 13, 2025Updated 4 months ago
- ☆24Jan 26, 2026Updated last month
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 7 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆217Aug 11, 2025Updated 6 months ago
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆108Feb 10, 2026Updated 3 weeks ago
- Transition Models☆145Oct 7, 2025Updated 5 months ago
- ComfyUI nodes for CAD loading, manipulation, meshing. Using GMSH and OCC.☆22Mar 1, 2026Updated last week
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆23Feb 11, 2026Updated 3 weeks ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆154Updated this week
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 5 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 2 months ago
- A Unified Visual Generator with Interleaved OmniModal Context☆200Updated this week
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆21Jul 3, 2025Updated 8 months ago
- ComfyUI version of WithAnyone☆24Dec 18, 2025Updated 2 months ago
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆70Feb 12, 2026Updated 3 weeks ago
- ☆83Jan 25, 2026Updated last month
- ☆36Updated this week
- ☆131Dec 24, 2025Updated 2 months ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 7 months ago
- [NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction☆125Sep 26, 2024Updated last year
- ☆16Nov 28, 2023Updated 2 years ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆40Jan 29, 2026Updated last month
- Unofficial implementation of MIMO (MImicking anyone anywhere with complex Motions and Object interactions)☆10Nov 22, 2024Updated last year
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Oct 12, 2025Updated 4 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 6 months ago
- [CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…☆457Feb 21, 2026Updated 2 weeks ago
- ☆89Jan 4, 2026Updated 2 months ago
- ☆110Sep 3, 2025Updated 6 months ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- ☆37Sep 5, 2024Updated last year