PICABench: How Far Are We from Physically Realistic Image Editing?
☆36Nov 5, 2025Updated 3 months ago
Alternatives and similar repositories for PICABench
Users that are interested in PICABench are comparing it to the libraries listed below
Sorting:
- UnicEdit-10M and UnicBench project☆23Feb 8, 2026Updated 3 weeks ago
- Decoupled Textual Embeddings for Customized Image Generation (AAAI 2024)☆30Feb 29, 2024Updated 2 years ago
- ☆16Mar 24, 2025Updated 11 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆89Apr 1, 2025Updated 11 months ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Aug 23, 2025Updated 6 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆94Nov 21, 2025Updated 3 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- [CVPR 2026] V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties☆131Jan 17, 2026Updated last month
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆34Jan 26, 2026Updated last month
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 9 months ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆232Feb 10, 2026Updated 2 weeks ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆73Feb 13, 2026Updated 2 weeks ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆35Dec 22, 2025Updated 2 months ago
- ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…☆114Jan 30, 2026Updated last month
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- A collection of awesome think with videos papers.☆90Dec 1, 2025Updated 3 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆44Jun 27, 2025Updated 8 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥☆36May 11, 2024Updated last year
- [NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…☆40Feb 20, 2025Updated last year
- [AAAI2026] Implementation Code for Omni-Effects☆173Dec 9, 2025Updated 2 months ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆110Updated this week
- ☆95Feb 4, 2026Updated 3 weeks ago
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆92Dec 1, 2025Updated 3 months ago
- ☆24Dec 19, 2025Updated 2 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- MCP server for Grok AI API integration☆21Jun 2, 2025Updated 8 months ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆34Feb 9, 2026Updated 3 weeks ago
- Codes for Difflare: Removing Image Flare with Latent Diffusion Models☆11Dec 24, 2024Updated last year
- ☆16Sep 18, 2025Updated 5 months ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆24Jan 21, 2026Updated last month
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year