muzishen/RCDMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/muzishen/RCDMs)

muzishen / RCDMs

[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.

☆70

Alternatives and similar repositories for RCDMs

Users that are interested in RCDMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Xovee / skapp
View on GitHub
AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction
☆24Jul 8, 2026Updated 2 weeks ago
RaynorLEE / CATS
View on GitHub
[AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph R…
☆17Aug 26, 2025Updated 10 months ago
fengxueguiren / CoPEFT
View on GitHub
[AAAI 2025] CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
☆28Apr 14, 2025Updated last year
can-can-ya / QPMIL-VL
View on GitHub
✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
☆54Apr 16, 2025Updated last year
zizheng-guo / RhythmMamba
View on GitHub
[AAAI 2025] RhythmMamba
☆110Jul 29, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ziwliu8 / SIGMA
View on GitHub
[AAAI'2025] The official implementation code of SIGMA
☆41Oct 14, 2025Updated 9 months ago
mdswyz / ReAtCo
View on GitHub
An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)
☆27Dec 18, 2024Updated last year
Xovee / cs-conf-stats
View on GitHub
Computer Science Conference Statistics: Explore number of submissions, acceptance rate, and many more.
☆44Jul 3, 2026Updated 3 weeks ago
tangtaogo / alignmif
View on GitHub
☆40Jul 20, 2024Updated 2 years ago
dirtycomputer / O2M_attack
View on GitHub
☆71Dec 18, 2024Updated last year
kunzhan / HCN
View on GitHub
AAAI 2025: Hierarchical Consensus Network for Multiview Feature Learning
☆18Feb 5, 2025Updated last year
924973292 / DeMo
View on GitHub
【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification
☆76Mar 9, 2025Updated last year
yangbincv / ADCA
View on GitHub
☆59Jun 14, 2023Updated 3 years ago
kunzhan / BrainGuard
View on GitHub
AAAI 2025 (Oral), BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
☆23Dec 1, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Alibaba-VELLDEPTH / AttentiveEraser
View on GitHub
Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…
☆222Jun 16, 2026Updated last month
muzishen / IMAGPose
View on GitHub
[NeurIPS 2024] 🕺IMAGPose🕺: A Unified Conditional Framework for Pose-Guided Person Generation. IMAGPose enables versatile pose-guided im…
☆188Sep 30, 2025Updated 9 months ago
UESTC-nnLab / MoPKL
View on GitHub
[2025] Language-driven Motion Prior Knowledge Learning for Moving Infrared Small Target Detection
☆49Jun 17, 2026Updated last month
LingjieKong-fdu / CustAny
View on GitHub
Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)
☆47Apr 10, 2025Updated last year
Jian-Lang / RAGPT
View on GitHub
This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AA…
☆66May 26, 2026Updated last month
donahowe / TheaterGen
View on GitHub
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
☆69Sep 26, 2024Updated last year
tobran / StoryImager
View on GitHub
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
☆40Jul 5, 2024Updated 2 years ago
kriskrisliu / PAT
View on GitHub
[AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models
☆37Feb 1, 2025Updated last year
tencent-ailab / PCDMs
View on GitHub
Implementation code：Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
☆192Sep 30, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
agwmon / MuDI
View on GitHub
[NeurIPS 2024] MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
☆96Jan 17, 2025Updated last year
muzishen / IMAGDressing
View on GitHub
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation …
☆1,342Sep 30, 2025Updated 9 months ago
Shengjia-C / AugTarget
View on GitHub
AugTarget data augmentation for infrared small target detection.
☆20May 19, 2023Updated 3 years ago
YaNgZhAnG-V5 / attention_regulation
View on GitHub
[ECCV24] Attention Regulation on T2I Diffusion Models
☆19Jul 8, 2024Updated 2 years ago
muzishen / Deepsort_V2
View on GitHub
2020中兴捧月阿尔法赛道多目标检测和跟踪初赛第一名方案
☆33Oct 3, 2023Updated 2 years ago
xiaoqian-shen / StoryGPT-V
View on GitHub
[CVPR 2025] Official PyTorch implementation of StoryGPT-V
☆42Jun 14, 2025Updated last year
UESTC-nnLab / SSTNet
View on GitHub
[TGRS 2024] SSTNet: Sliced spatio-temporal network with cross-slice ConvLSTM for moving infrared dim-small target detection
☆68Jan 21, 2025Updated last year
littlepure2333 / GFlow
View on GitHub
[AAAI 2025] GFlow: Recovering 4D World from Monocular Video
☆72May 8, 2025Updated last year
W-JG / RAP-SR
View on GitHub
[AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…
☆17Mar 22, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
muzishen / VIPriors-Object-Detection-Challenge
View on GitHub
[ECCV 2020 Workshop] VIPirios Object Detection Champion
☆44Jul 10, 2023Updated 3 years ago
tobran / ONE-PIC
View on GitHub
☆17Jul 23, 2024Updated 2 years ago
tangtaogo / lidar-nerf
View on GitHub
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields
☆163Jul 17, 2023Updated 3 years ago
muzishen / PCDMs
View on GitHub
[ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
☆50Jan 17, 2024Updated 2 years ago
sail-sg / ScaleLong
View on GitHub
The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…
☆50Oct 23, 2023Updated 2 years ago
opendatalab / VHM
View on GitHub
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
☆122Mar 25, 2026Updated 4 months ago
zixiangzhou916 / UDE-2
View on GitHub
☆22Apr 17, 2024Updated 2 years ago