PICABench: How Far Are We from Physically Realistic Image Editing?
☆36Nov 5, 2025Updated 4 months ago
Alternatives and similar repositories for PICABench
Users that are interested in PICABench are comparing it to the libraries listed below
Sorting:
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 2 months ago
- [ICCV 2025] LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal☆26Oct 20, 2025Updated 5 months ago
- [AAAI 2024] Decoupled Textual Embeddings for Customized Image Generation☆30Feb 29, 2024Updated 2 years ago
- [CVPR 2026] UnicEdit-10M and UnicBench project☆25Mar 3, 2026Updated 2 weeks ago
- ☆16Mar 24, 2025Updated 11 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆90Apr 1, 2025Updated 11 months ago
- ☆40Dec 16, 2025Updated 3 months ago
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆96Nov 21, 2025Updated 4 months ago
- Lottie animations renderer using rlottie.☆12Jan 19, 2026Updated 2 months ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Aug 23, 2025Updated 6 months ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated last month
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆312Sep 28, 2025Updated 5 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 6 months ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆35Jan 26, 2026Updated last month
- A PyTorch implementation of the "Image Inpainting for Irregular Holes Using Partial Convolutions" paper from Liu et al at NVIDIA☆10Aug 24, 2019Updated 6 years ago
- [NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…☆40Feb 20, 2025Updated last year
- PyTorch implementation of the paper "Region-Aware Portrait Retouching with Sparse Interactive Guidance“ published in IEEE Transactions o…☆15Jun 14, 2023Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- Gives you paid Windows 7 Extended Security Updates until January 2023, for free.☆16Apr 15, 2020Updated 5 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- ☆99Feb 4, 2026Updated last month
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 3 months ago
- ☆11Nov 29, 2020Updated 5 years ago
- OFER: Occluded Face Expression Reconstruction. A 3D face reconstruction method producing diverse plausible expressive faces from a single…☆14Jan 9, 2026Updated 2 months ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆39Dec 22, 2025Updated 3 months ago
- Paper Statistics for CVPR‘22☆14Jun 1, 2022Updated 3 years ago
- Fellou news - fellou.ai/blog☆22Oct 24, 2025Updated 4 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- Clean minimal implementation of Free-Form Image Inpainting with Gated Convolutions in pytorch lightning. Inspired from pytorch implementa…☆13Nov 22, 2022Updated 3 years ago
- ☆16May 13, 2024Updated last year
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆51Jan 30, 2026Updated last month
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆83Mar 9, 2026Updated last week
- ☆13Apr 23, 2025Updated 10 months ago