UCSC-VLAA/Complex-Edit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UCSC-VLAA/Complex-Edit)

UCSC-VLAA / Complex-Edit

Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark

☆28

Alternatives and similar repositories for Complex-Edit

Users that are interested in Complex-Edit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / PIXELS
View on GitHub
Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"
☆11Dec 17, 2024Updated last year
Eureka-Maggie / MIGE
View on GitHub
Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
☆73Jul 13, 2025Updated 10 months ago
OPPO-Mente-Lab / X2Edit
View on GitHub
AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
☆97Nov 21, 2025Updated 6 months ago
Tianhao-Qi / Mask2DiT
View on GitHub
CVPR 2025 Accepted Papers
☆25Dec 20, 2025Updated 5 months ago
researchmm / AI_Illustrator
View on GitHub
[MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
☆11Apr 3, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhangzef / COOPER
View on GitHub
The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆36Dec 30, 2025Updated 4 months ago
UCSC-VLAA / FedConv
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…
☆25Apr 30, 2024Updated 2 years ago
Tencent / HaploVLM
View on GitHub
ICML2025
☆64Aug 28, 2025Updated 8 months ago
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆25Aug 9, 2025Updated 9 months ago
Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆20Nov 4, 2025Updated 6 months ago
AFeng-x / PixWizard
View on GitHub
[ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…
☆210May 5, 2025Updated last year
UCSC-VLAA / CLIPS
View on GitHub
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆40Apr 18, 2025Updated last year
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
aim-uofa / FreeCustom
View on GitHub
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
☆177Sep 1, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
KAIST-Visual-AI-Group / Flow-Inference-Time-Scaling
View on GitHub
[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆74Oct 12, 2025Updated 7 months ago
fudan-zvg / TDAS
View on GitHub
☆18Jun 10, 2022Updated 3 years ago
TFNTF / PostEdit
View on GitHub
Codes of PostEdit
☆23Apr 28, 2025Updated last year
WeihuangLin / INF-LLaVA
View on GitHub
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Aug 4, 2024Updated last year
UCSC-VLAA / CRATE-alpha
View on GitHub
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
☆46Jun 3, 2024Updated last year
antonioo-c / Diptych-Prompting
View on GitHub
Unofficial implementation of 'Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator'
☆10Dec 10, 2024Updated last year
wangf3014 / Patch_Scaling
View on GitHub
Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
☆25Feb 25, 2025Updated last year
limbo0000 / mtm
View on GitHub
Official implementation of MTM
☆21Aug 30, 2023Updated 2 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆14Mar 7, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
knightyxp / VideoCoF
View on GitHub
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
☆184Feb 22, 2026Updated 2 months ago
Lucky-Lance / SPP
View on GitHub
[ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
☆22May 28, 2024Updated last year
guozinan126 / MUSAR
View on GitHub
☆30May 7, 2025Updated last year
DCDmllm / AnyEdit
View on GitHub
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
☆220Apr 5, 2025Updated last year
jianzongwu / MotionBooth
View on GitHub
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆138Oct 8, 2024Updated last year
Sainzerjj / FitDiT
View on GitHub
The training implementation of the paper "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on."
☆12Apr 8, 2025Updated last year
TIGER-AI-Lab / OmniEdit
View on GitHub
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
☆145Jan 27, 2025Updated last year
lzyhha / VisualCloze
View on GitHub
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…
☆282Jan 7, 2026Updated 4 months ago
KaKituken / affordance-aware-any
View on GitHub
Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion
☆48Feb 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PeterGriffinJin / InstructG2I
View on GitHub
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)
☆20Oct 17, 2024Updated last year
UCSC-VLAA / HQ-Edit
View on GitHub
[ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
☆113Apr 18, 2024Updated 2 years ago
JIA-Lab-research / DreamOmni3
View on GitHub
This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''
☆39Dec 30, 2025Updated 4 months ago
yuhuixu1993 / BNET
View on GitHub
Batch Normalization with Enhanced Linear Transformation
☆53Mar 13, 2024Updated 2 years ago
junjiehe96 / UniPortrait
View on GitHub
[ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
☆277May 1, 2025Updated last year
snap-research / AVLink
View on GitHub
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
☆17Aug 3, 2025Updated 9 months ago
ai-anchorite / Diffusers-Image-Community
View on GitHub
Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…
☆17Nov 9, 2024Updated last year