SpotEdit:Selective Region Editing in Diffusion Transformers
☆196Jan 5, 2026Updated 6 months ago
Alternatives and similar repositories for SpotEdit
Users that are interested in SpotEdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆67Feb 22, 2026Updated 4 months ago
- (ECCV2026) RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆65Updated this week
- [ECCV 2026] DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆653May 22, 2026Updated last month
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆75Jun 22, 2026Updated last week
- Long-range camera-conditioned scene generation from one single image.☆111Dec 23, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆75Feb 26, 2026Updated 4 months ago
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆152Apr 5, 2026Updated 2 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆134May 22, 2025Updated last year
- The official repository of paper "Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion"☆307Jan 12, 2026Updated 5 months ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Jun 25, 2026Updated last week
- Controlnet module for Wan2.2☆45Oct 30, 2025Updated 8 months ago
- Official implementation of "Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model"☆271Apr 25, 2026Updated 2 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆38Apr 25, 2026Updated 2 months ago
- [Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder…☆152Dec 18, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation☆633Dec 23, 2025Updated 6 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆76Apr 28, 2026Updated 2 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆189Dec 11, 2025Updated 6 months ago
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆116Feb 5, 2026Updated 4 months ago
- ☆104Nov 17, 2025Updated 7 months ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated last month
- [Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling☆121Jun 9, 2026Updated 3 weeks ago
- DreamStyle: A Unified Framework for Video Stylization☆122Jan 7, 2026Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆50Mar 25, 2026Updated 3 months ago
- Accepted by ICML2026☆89Updated this week
- ☆13Mar 8, 2024Updated 2 years ago
- The official code of Yume☆675Jan 14, 2026Updated 5 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- [CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆120Feb 28, 2026Updated 4 months ago
- ☆83Oct 13, 2025Updated 8 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆38Nov 5, 2025Updated 7 months ago
- OmniGAIA: Towards Native Omni-Modal AI Agents☆134Apr 2, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆334Dec 15, 2025Updated 6 months ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆96Nov 30, 2025Updated 7 months ago
- Jittor挑战赛,骨骼绑定赛题☆15Oct 9, 2025Updated 8 months ago
- ☆29Apr 30, 2024Updated 2 years ago
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆125Apr 2, 2026Updated 3 months ago
- [AAAI 2026] This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆149Apr 24, 2026Updated 2 months ago
- Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"☆367May 12, 2026Updated last month