PICABench: How Far Are We from Physically Realistic Image Editing?
☆38Nov 5, 2025Updated 7 months ago
Alternatives and similar repositories for PICABench
Users that are interested in PICABench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 6 months ago
- [AAAI 2024] Decoupled Textual Embeddings for Customized Image Generation☆30Feb 29, 2024Updated 2 years ago
- ☆17Mar 24, 2025Updated last year
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆98Nov 21, 2025Updated 7 months ago
- ☆47Dec 16, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆26Aug 23, 2025Updated 10 months ago
- Developer project for getting basic API integrations working in under 5 minutes☆11May 22, 2026Updated last month
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆26Jul 30, 2025Updated 11 months ago
- [CVPR 2026] UnicEdit-10M and UnicBench project☆41Mar 3, 2026Updated 3 months ago
- [CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆120Feb 28, 2026Updated 4 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Nov 25, 2025Updated 7 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆315Sep 28, 2025Updated 9 months ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 9 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆39Jan 26, 2026Updated 5 months ago
- Official Codebase for CVPR 2026 Highlight Paper: "Building a Precise Video Language with Human–AI Oversight"☆134May 13, 2026Updated last month
- A PyTorch implementation of the "Image Inpainting for Irregular Holes Using Partial Convolutions" paper from Liu et al at NVIDIA☆11Aug 24, 2019Updated 6 years ago
- [NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…☆40Feb 20, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- ☆104Feb 4, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆26Dec 12, 2025Updated 6 months ago
- ☆11Nov 29, 2020Updated 5 years ago
- OFER: Occluded Face Expression Reconstruction. A 3D face reconstruction method producing diverse plausible expressive faces from a single…☆14Jan 9, 2026Updated 5 months ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆42Dec 22, 2025Updated 6 months ago
- Official implementation of TC-Padé (CVPR 2026)☆30May 11, 2026Updated last month
- Paper Statistics for CVPR‘22☆14Jun 1, 2022Updated 4 years ago
- [ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward☆107May 25, 2026Updated last month
- ☆16May 13, 2024Updated 2 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Clean minimal implementation of Free-Form Image Inpainting with Gated Convolutions in pytorch lightning. Inspired from pytorch implementa…☆13Nov 22, 2022Updated 3 years ago
- InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥☆37May 11, 2024Updated 2 years ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆46Jun 27, 2025Updated last year
- Working note for WSI analysis☆10Apr 3, 2023Updated 3 years ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆99Mar 9, 2026Updated 3 months ago
- ☆11Jul 3, 2019Updated 6 years ago
- Official implementation of "ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Class…☆12Mar 6, 2023Updated 3 years ago