yejy53 / Nano-banana-150kLinks

Nano-consistent-150k

☆238

Alternatives and similar repositories for Nano-banana-150k

Users that are interested in Nano-banana-150k are comparing it to the libraries listed below

Sorting:

CodeGoat24 / Pref-GRPO
Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
☆182Updated 2 weeks ago
wyhlovecpp / GPT-Image-Edit
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆233Updated 2 months ago
PKU-YuanGroup / Edit-R1
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆159Updated 2 weeks ago
yuriYanZeXuan / EEdit
(ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
☆57Updated last month
XianfengWu01 / LightGen
An Efficient Text-to-Image Generation Pretrain Pipeline
☆119Updated 6 months ago
rongyaofang / prism-bench
This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…
☆106Updated 2 months ago
TempleX98 / EasyRef
[ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
☆69Updated 3 months ago
YujiaHu1109 / IEAP
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
☆106Updated last month
tinnerhrhe / EvoSearch-codes
An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search
☆98Updated last month
bytedance / ContentV
☆129Updated 4 months ago
TencentARC / BlobCtrl
[SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing
☆22Updated 7 months ago
Eureka-Maggie / MIGE
Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
☆70Updated 4 months ago
AMAP-ML / S2-Guidance
Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
☆142Updated last month
YuqingWang1029 / PAR
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆178Updated 7 months ago
ali-vilab / ChatDiT
☆51Updated 10 months ago
NJU-PCALab / TextCrafter
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
☆81Updated 3 months ago
mayuelala / FollowYourShape
[ArXiv 2025] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory…
☆51Updated 3 months ago
Correr-Zhou / MagicTailor
[IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …
☆99Updated 6 months ago
aniki-ly / FreeLong
[NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…
☆60Updated 4 months ago
illume-unified-mllm / ILLUME_plus
☆121Updated 2 months ago
xzc-zju / UltraVideo
[[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions
☆67Updated 4 months ago
PKU-YuanGroup / ImgEdit
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
☆224Updated last week
knightyxp / VideoGrain
[ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …
☆155Updated 7 months ago
gogoduan / GoT-R1
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
☆101Updated 5 months ago
HuiZhang0812 / CreatiLayout
[ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
☆117Updated 3 months ago
chenllliang / DreamEngine
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆120Updated 8 months ago
TencentARC / MindOmni
☆132Updated 3 weeks ago
alibaba-damo-academy / Lumos
Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.
☆140Updated 3 months ago
AMAP-ML / Omni-Effects
Implementation Code for Omni-Effects
☆151Updated 2 months ago
zhaoshitian / LeX-Art
Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"
☆73Updated 2 months ago