Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆28Apr 22, 2025Updated 10 months ago
Alternatives and similar repositories for Complex-Edit
Users that are interested in Complex-Edit are comparing it to the libraries listed below
Sorting:
- Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"☆11Dec 17, 2024Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 7 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 2 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- ☆11Nov 30, 2025Updated 3 months ago
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated 2 months ago
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆94Nov 21, 2025Updated 3 months ago
- ☆13Jan 22, 2025Updated last year
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 6 months ago
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Oct 12, 2025Updated 4 months ago
- ☆18Jun 10, 2022Updated 3 years ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated 9 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 4 months ago
- Codes of PostEdit☆23Apr 28, 2025Updated 10 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- ☆44Jun 7, 2024Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆38Dec 30, 2025Updated 2 months ago
- InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)☆20Oct 17, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last week
- Official implementation of MTM☆21Aug 30, 2023Updated 2 years ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆21May 28, 2024Updated last year
- ICML2025☆63Aug 28, 2025Updated 6 months ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- This respository contains the code for the CVPR 2023 paper SINE: SINgle Image Editing with Text-to-Image Diffusion Models.☆190Jan 11, 2024Updated 2 years ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆141Jan 27, 2025Updated last year
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion☆47Feb 21, 2025Updated last year
- Official PyTorch/Diffusers implementation of "RectifiedHR: Enable Efficient High Resolution Image Generation via Energy Rectification"☆30Oct 11, 2025Updated 4 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Apr 30, 2024Updated last year
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆47Jun 3, 2024Updated last year
- ☆41May 27, 2025Updated 9 months ago
- Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models☆51Updated this week
- ☆29May 7, 2025Updated 9 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated last month
- Long-range camera-conditioned scene generation from one single image.☆105Dec 23, 2025Updated 2 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Jan 24, 2025Updated last year
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated 2 months ago