PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
☆31Jan 24, 2025Updated last year
Alternatives and similar repositories for InstructAny2Pix
Users that are interested in InstructAny2Pix are comparing it to the libraries listed below
Sorting:
- ☆20Sep 17, 2024Updated last year
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆25Apr 10, 2024Updated last year
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Jan 18, 2024Updated 2 years ago
- [MM 2023] Toward High Quality Facial Representation Learning☆19Oct 30, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- Official code of SmartEdit [CVPR-2024 Highlight]☆370Jun 21, 2024Updated last year
- Self-supervised Learning to Bring Dual Reversed Rolling Shutter Images Alive (ICCV2023)☆15Jul 6, 2024Updated last year
- DDS: Delta Denoising Score PyTorch implementation☆19Sep 2, 2023Updated 2 years ago
- ☆20Apr 15, 2025Updated 10 months ago
- Finetune Stable Video Diffusion with Lora☆19Feb 3, 2024Updated 2 years ago
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆83Nov 21, 2024Updated last year
- ☆28Apr 15, 2024Updated last year
- [ICCV 2023] The datasets and code used in our paper "Foreground Object Search by Distilling Composite Image Feature", ICCV2023.☆22Feb 24, 2026Updated last week
- [NeurIPS 2023] Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing☆112May 15, 2024Updated last year
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 2 months ago
- ☆26Jul 17, 2025Updated 7 months ago
- ☆30Apr 24, 2025Updated 10 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year
- Trying to implement https://arxiv.org/abs/2305.08891☆34Jun 10, 2023Updated 2 years ago
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆30Mar 29, 2024Updated last year
- we propose FlexEdit, an end-to-end image editing method that leverages both free-shape masks and language instructions for Flexible Editi…☆32Aug 22, 2024Updated last year
- forked from DongZhouGu/arxiv-daily☆22Nov 8, 2022Updated 3 years ago
- [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".☆402Feb 20, 2025Updated last year
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆136Dec 21, 2024Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editing☆840Aug 19, 2024Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆142Apr 16, 2025Updated 10 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Feb 26, 2025Updated last year
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆37Jan 25, 2024Updated 2 years ago
- 🕹 Pikachu-volleyball game-based multi-agent RL environment using PettingZoo☆11Sep 29, 2024Updated last year
- 李宏毅机器学习课程笔记☆10Jul 3, 2022Updated 3 years ago
- Web interface for building and managing your own agentic record label.☆10Updated this week
- 这是一次学校大作业,希望和大家分享,一起进步。此项目分驱动部分,遥控部分,视觉部分以及Web控制部分。是基于ESP32与Jetson Nano做的一个小项目。其中运用到了蓝牙串口片与片之间的通信,IP私域下的多机通信,以及ESP32中便携的Web功能进行通信。具体各部分内容…☆12Nov 5, 2024Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- This is a list used to collect the available (open-source / closed-source) projects that comply with Google Agent2Agent.☆13Apr 24, 2025Updated 10 months ago
- ☆10Feb 10, 2026Updated 3 weeks ago
- 稚晖君电子Esp32脱机版☆11Jan 15, 2025Updated last year
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆442May 14, 2024Updated last year
- ☆35Nov 5, 2024Updated last year