SpotEdit:Selective Region Editing in Diffusion Transformers
☆173Jan 5, 2026Updated 2 months ago
Alternatives and similar repositories for SpotEdit
Users that are interested in SpotEdit are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆59Feb 22, 2026Updated last week
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆107Feb 21, 2026Updated last week
- OmniGAIA: Towards Native Omni-Modal AI Agents☆46Updated this week
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆546Jan 13, 2026Updated last month
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Updated this week
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆167Dec 11, 2025Updated 2 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆134Dec 18, 2025Updated 2 months ago
- Official implementation of Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model☆235Dec 8, 2025Updated 2 months ago
- Controlnet module for Wan2.2☆42Oct 30, 2025Updated 4 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- DreamStyle: A Unified Framework for Video Stylization☆109Jan 7, 2026Updated last month
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆113Feb 5, 2026Updated last month
- ☆39Oct 29, 2025Updated 4 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation☆585Dec 23, 2025Updated 2 months ago
- A CS221 final project: a dominoes AI☆11Dec 17, 2016Updated 9 years ago
- ☆34Jan 25, 2026Updated last month
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆27Feb 14, 2026Updated 2 weeks ago
- Qwen3-TTS 支持 10 种主要语言(中文、英文、日文、韩文、德文、法文、俄文、葡萄牙文、西班牙文和意大利文)以及多种方言音色☆75Jan 29, 2026Updated last month
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆90Nov 30, 2025Updated 3 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆62Dec 11, 2025Updated 2 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 6 months ago
- Overworld's local world client interface to run Waypoint world models☆46Updated this week
- Long-range camera-conditioned scene generation from one single image.☆105Dec 23, 2025Updated 2 months ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated last year
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated last week
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆307Dec 15, 2025Updated 2 months ago
- Official implementation of paper: "SwinTExCo: Exemplar-based Video Colorization using Swin Transformer"☆13Oct 6, 2024Updated last year
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 3 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆53Feb 4, 2026Updated last month
- Code2Worlds: Empowering Coding LLMs for 4D World Generation☆79Feb 26, 2026Updated last week
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆172Feb 4, 2026Updated last month
- lite attention implemented over flash attention 3☆45Updated this week
- Run your AI and CV algorithms in meetings such as Zoom, Meets or Teams! 🚀☆15Feb 18, 2024Updated 2 years ago
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 6 months ago