yejy53 / Nano-banana-150kLinks
Nano-consistent-150k
☆238Updated 3 weeks ago
Alternatives and similar repositories for Nano-banana-150k
Users that are interested in Nano-banana-150k are comparing it to the libraries listed below
Sorting:
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆182Updated 2 weeks ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆233Updated 2 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆159Updated 2 weeks ago
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆57Updated last month
- An Efficient Text-to-Image Generation Pretrain Pipeline☆119Updated 6 months ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆106Updated 2 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆69Updated 3 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆106Updated last month
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆98Updated last month
- ☆129Updated 4 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆22Updated 7 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆70Updated 4 months ago
- Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"☆142Updated last month
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆178Updated 7 months ago
- ☆51Updated 10 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆81Updated 3 months ago
- [ArXiv 2025] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory…☆51Updated 3 months ago
- [IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …☆99Updated 6 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆60Updated 4 months ago
- ☆121Updated 2 months ago
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆67Updated 4 months ago
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆224Updated last week
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆155Updated 7 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆101Updated 5 months ago
- [ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation☆117Updated 3 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 8 months ago
- ☆132Updated 3 weeks ago
- Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆140Updated 3 months ago
- Implementation Code for Omni-Effects☆151Updated 2 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆73Updated 2 months ago