PeterGriffinJin / InstructG2ILinks
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)
☆20Updated last year
Alternatives and similar repositories for InstructG2I
Users that are interested in InstructG2I are comparing it to the libraries listed below
Sorting:
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Updated 11 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Updated 9 months ago
- [ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editing☆138Updated 5 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆51Updated last year
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆73Updated 3 months ago
- Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation☆42Updated 5 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆131Updated 7 months ago
- [ICLR 2026] Generative Universal Verifier as Multimodal Meta-Reasoner☆44Updated 2 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆33Updated 6 months ago
- ☆47Updated 9 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 9 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆61Updated last year
- [NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆94Updated 2 months ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆72Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Updated 11 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Updated 5 months ago
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆21Updated 10 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆66Updated last year
- [ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectio…☆99Updated 11 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆57Updated 3 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Updated 5 months ago
- ☆37Updated 8 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Updated 6 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Updated 2 months ago
- ☆64Updated 2 years ago
- Video Diffusion Transformers are In-Context Learners☆36Updated last year
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆119Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 8 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Updated 5 months ago