[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
☆68Sep 30, 2025Updated 7 months ago
Alternatives and similar repositories for RCDMs
Users that are interested in RCDMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback☆34Dec 16, 2025Updated 5 months ago
- Source code for AAAI'25 paper "Component-Level Segmentation for Oracle Bone Inscription Decipherment"☆20Oct 13, 2025Updated 7 months ago
- ✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification☆53Apr 16, 2025Updated last year
- [AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph R…☆17Aug 26, 2025Updated 8 months ago
- [AAAI 2025] CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning☆28Apr 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2025] RhythmMamba☆107Jul 29, 2025Updated 9 months ago
- [AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segm…☆41Dec 17, 2024Updated last year
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆23Apr 27, 2026Updated 3 weeks ago
- [AAAI'2025] The official implementation code of SIGMA☆41Oct 14, 2025Updated 7 months ago
- [AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video☆115Jun 14, 2025Updated 11 months ago
- ☆26Mar 16, 2026Updated 2 months ago
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆27Dec 18, 2024Updated last year
- ☆71Dec 18, 2024Updated last year
- [2025] Language-driven Motion Prior Knowledge Learning for Moving Infrared Small Target Detection☆46Sep 26, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 【CVPR2025】IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification☆47Apr 8, 2025Updated last year
- 【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification☆72Mar 9, 2025Updated last year
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆87May 13, 2025Updated last year
- AAAI 2025: Autonomous LLM-enhanced adversarial attack for text-to-motion☆19Sep 15, 2025Updated 8 months ago
- ☆58Jun 14, 2023Updated 2 years ago
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆218May 9, 2025Updated last year
- [NeurIPS 2024] 🕺IMAGPose🕺: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided im…☆190Sep 30, 2025Updated 7 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆68Sep 26, 2024Updated last year
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆47Apr 10, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICCV 2025] MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation☆52Oct 14, 2025Updated 7 months ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆40Jul 5, 2024Updated last year
- Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆192Sep 30, 2025Updated 7 months ago
- [NeurIPS 2024] MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models☆96Jan 17, 2025Updated last year
- [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation …☆1,337Sep 30, 2025Updated 7 months ago
- [ECCV24] Attention Regulation on T2I Diffusion Models☆19Jul 8, 2024Updated last year
- [CVPR 2025] Official PyTorch implementation of StoryGPT-V☆41Jun 14, 2025Updated 11 months ago
- Full-stack learning☆50Mar 16, 2025Updated last year
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆18Mar 22, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Jul 23, 2024Updated last year
- [ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Jan 17, 2024Updated 2 years ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton based Action Recognition☆11Aug 30, 2021Updated 4 years ago
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆202Jul 9, 2023Updated 2 years ago
- ☆22Apr 17, 2024Updated 2 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 7 months ago