[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆120Sep 30, 2025Updated 5 months ago
Alternatives and similar repositories for RCDMs
Users that are interested in RCDMs are comparing it to the libraries listed below
Sorting:
- [AAAI-2025] Official Codes for “Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person R…☆20Feb 26, 2025Updated last year
- [AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback☆31Dec 16, 2025Updated 2 months ago
- This is the official code implement for AAAI 2025 paper ``Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimizat…☆22Mar 21, 2025Updated 11 months ago
- ✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification☆51Apr 16, 2025Updated 10 months ago
- [AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph R…☆17Aug 26, 2025Updated 6 months ago
- [AAAI 2025] CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning☆26Apr 14, 2025Updated 10 months ago
- [AAAI 2025] RhythmMamba☆100Jul 29, 2025Updated 7 months ago
- [AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segm…☆40Dec 17, 2024Updated last year
- [AAAI'2025] The official implementation code of SIGMA☆39Oct 14, 2025Updated 4 months ago
- ☆24Dec 13, 2025Updated 2 months ago
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆27Dec 18, 2024Updated last year
- This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AA…☆59Feb 14, 2026Updated 2 weeks ago
- AAAI 2025: Hierarchical Consensus Network for Multiview Feature Learning☆17Feb 5, 2025Updated last year
- [2025] Language-driven Motion Prior Knowledge Learning for Moving Infrared Small Target Detection☆41Sep 26, 2025Updated 5 months ago
- Computer Science Conference Statistics: Explore number of submissions, acceptance rate, and many more.☆36Feb 22, 2026Updated last week
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆84May 13, 2025Updated 9 months ago
- ☆40Jul 20, 2024Updated last year
- AAAI 2025 (Oral), BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities☆21Dec 1, 2025Updated 3 months ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆40Jul 5, 2024Updated last year
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆68Sep 26, 2024Updated last year
- 【CVPR2025】IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification☆46Apr 8, 2025Updated 10 months ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆48Apr 10, 2025Updated 10 months ago
- ☆70Dec 18, 2024Updated last year
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆209May 9, 2025Updated 9 months ago
- [NeurIPS 2024] 🕺IMAGPose🕺: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided im…☆350Sep 30, 2025Updated 5 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆97Jan 17, 2025Updated last year
- ☆17Jul 23, 2024Updated last year
- [ICCV 2025] MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation☆51Oct 14, 2025Updated 4 months ago
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆18Mar 22, 2025Updated 11 months ago
- [ECCV24] Attention Regulation on T2I Diffusion Models☆19Jul 8, 2024Updated last year
- AugTarget data augmentation for infrared small target detection.☆20May 19, 2023Updated 2 years ago
- [CVPR 2025] Official PyTorch implementation of StoryGPT-V☆40Jun 14, 2025Updated 8 months ago
- Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆191Sep 30, 2025Updated 5 months ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- [CVPR 2022 Workshop] Biometrics Workshop Pet Biometric Challenge TOP3☆58Jul 19, 2023Updated 2 years ago
- The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …☆20Dec 15, 2024Updated last year
- ☆21Apr 17, 2024Updated last year
- [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation …☆1,328Sep 30, 2025Updated 5 months ago
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆67May 8, 2025Updated 9 months ago