[Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
☆208Apr 13, 2026Updated last month
Alternatives and similar repositories for SpatialEdit
Users that are interested in SpatialEdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [arXiv 2512.17796] Animate Any Character in Any World☆96Mar 10, 2026Updated 2 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 4 months ago
- ☆77Mar 30, 2026Updated 2 months ago
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆92Apr 30, 2026Updated last month
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆74Apr 28, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆95Nov 30, 2025Updated 6 months ago
- [SiggraphAsia25] OmnimatteZero: Fast Training-free Omnimatte with Pre-trained Video Diffusion Models☆224Feb 16, 2026Updated 3 months ago
- [SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D texture generation method using video diffusion priors.☆46Dec 12, 2025Updated 5 months ago
- Consistent Autoregressive Video Generation with Long Context☆88Feb 6, 2026Updated 4 months ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆32Jan 26, 2026Updated 4 months ago
- [CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…☆473Feb 21, 2026Updated 3 months ago
- Official repo for paper "SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation".☆56Mar 22, 2026Updated 2 months ago
- Complete Object Removal via Object-Effect Attention,you can try it in ComfyUI☆29Nov 24, 2025Updated 6 months ago
- [CVPR 2026] WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories (WorldExpand of HY-Wo…☆164Apr 24, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆110Sep 3, 2025Updated 9 months ago
- [ICML 2026] DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation☆267May 22, 2026Updated 2 weeks ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆75Feb 26, 2026Updated 3 months ago
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆90May 11, 2026Updated 3 weeks ago
- [ICML 2026] Pytorch implementation of Self-Refining Video Sampling☆174May 1, 2026Updated last month
- [CVPR 2026 Highlight & Best Paper of VideoWorldModel Workshop] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆601May 12, 2026Updated 3 weeks ago
- 3D In-the-Wild Human Dataset Generation with Diffusion Models☆48Apr 3, 2024Updated 2 years ago
- VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control☆395Feb 26, 2026Updated 3 months ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆146Feb 24, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆117Jan 26, 2026Updated 4 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆178Feb 4, 2026Updated 4 months ago
- Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation☆703May 19, 2026Updated 2 weeks ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆327Dec 15, 2025Updated 5 months ago
- Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players☆591May 28, 2026Updated last week
- [CVPR 2026 Highlight] Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision☆152Apr 16, 2026Updated last month
- [NeurIPS'25] OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates☆39Oct 20, 2025Updated 7 months ago
- Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models☆223May 12, 2026Updated 3 weeks ago
- Official Implementation of "Instance Segmentation of Scene Sketches Using Natural Image Priors" (SIGGRAPH 2025)☆92Sep 10, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Echo-TTS inference codebase☆193Dec 5, 2025Updated 6 months ago
- ☆10Sep 24, 2024Updated last year
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆27Apr 3, 2026Updated 2 months ago
- ☆19Jun 2, 2026Updated last week
- ☆89May 13, 2026Updated 3 weeks ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆83Mar 3, 2026Updated 3 months ago
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆88Jul 26, 2025Updated 10 months ago