[ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models
☆92Sep 11, 2025Updated 6 months ago
Alternatives and similar repositories for CoMPaSS
Users that are interested in CoMPaSS are comparing it to the libraries listed below
Sorting:
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆30Updated this week
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆218Nov 25, 2025Updated 3 months ago
- ☆86Feb 4, 2026Updated last month
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆115Mar 13, 2026Updated last week
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- A repository of Python & PyTorch scripts which (currently) converts .safetensors models into scaled FP8 variants, utilizing gradient desc…☆27Aug 8, 2025Updated 7 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆45Mar 3, 2026Updated 2 weeks ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 4 months ago
- [CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆744Feb 21, 2026Updated last month
- Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…☆185Dec 28, 2025Updated 2 months ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Apr 22, 2024Updated last year
- ☆187Jul 31, 2025Updated 7 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆422Aug 26, 2025Updated 6 months ago
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆225Feb 21, 2026Updated last month
- End2End Virtual Try-on with Visual Reference, CVPR2026☆58Updated this week
- Feed-forward model for predicting 3D physics with 3DGS + NeRF☆283Mar 5, 2026Updated 2 weeks ago
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆88Aug 18, 2025Updated 7 months ago
- Great Nodes for ComfyUI☆13Dec 4, 2025Updated 3 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 8 months ago
- [CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning☆1,215Sep 12, 2025Updated 6 months ago
- A quick port of Resynthesizer (the Gimp plug-in for content aware fill) to ComfyUI.☆30Jul 25, 2025Updated 7 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆173Feb 4, 2026Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆674Feb 13, 2026Updated last month
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆134Nov 27, 2025Updated 3 months ago
- HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.☆102Mar 15, 2026Updated last week
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- Official implementation of "Normalized Attention Guidance"☆186Jul 1, 2025Updated 8 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆80Dec 12, 2025Updated 3 months ago
- Following the advance of AIGC☆23Oct 28, 2025Updated 4 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆50Updated this week
- ☆35Aug 31, 2025Updated 6 months ago
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Aug 15, 2025Updated 7 months ago
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.☆23Jul 26, 2025Updated 7 months ago
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated 9 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆242Aug 22, 2025Updated 7 months ago
- TBG Enhanced Tiled Upscaler and Refiner upscales up to 200MP with precise control. It features dual-model processing (structure + detail)…☆123Mar 4, 2026Updated 2 weeks ago
- [ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing☆558Aug 20, 2025Updated 7 months ago
- CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework" 🔥☆125Feb 22, 2026Updated last month
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆124Oct 22, 2025Updated 4 months ago