Consistent Human Image and Video Generation with Spatially Conditioned Diffusion
☆15Sep 1, 2025Updated 6 months ago
Alternatives and similar repositories for SCD
Users that are interested in SCD are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] PoseGen: Learning to Generate 3D Human Pose Datasets with NeRF☆10Dec 29, 2023Updated 2 years ago
- ☆13Jul 10, 2024Updated last year
- [ICASSP 2022] Official PyTorch Implementation for "Attention Probe: Vision Transformer Distillation in the Wild" (ICASSP 2022)☆11Jan 23, 2022Updated 4 years ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Dec 12, 2024Updated last year
- ☆15Jan 8, 2024Updated 2 years ago
- ☆16Feb 23, 2025Updated last year
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21Jul 26, 2025Updated 7 months ago
- ☆17Feb 13, 2024Updated 2 years ago
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆91Sep 12, 2025Updated 5 months ago
- ☆27Mar 3, 2025Updated 11 months ago
- Rolling Shutter Correction with Intermediate Distortion Flow Estimation☆23Nov 27, 2024Updated last year
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 4 months ago
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆56Sep 16, 2024Updated last year
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- PDM-based Purifier☆22Nov 5, 2024Updated last year
- ☆62Jun 25, 2024Updated last year
- (ICCV2025) ToMiE: Towards Explicit Exoskeleton for the Reconstruction of Complicated 3D Human Avatars☆42Jan 19, 2026Updated last month
- [CVPR 2022] Learning Adaptive Warping for Real-World Rolling Shutter Correction☆32May 31, 2024Updated last year
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆37Jul 23, 2025Updated 7 months ago
- Code for "HumanGif: Single-View Human Diffusion with Generative Prior"☆31Jun 29, 2025Updated 8 months ago
- [NeurIPS 2023] Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment☆37Oct 11, 2023Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- [CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆158Apr 15, 2025Updated 10 months ago
- ☆20Sep 5, 2025Updated 5 months ago
- [AAAI'25] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆101Jul 18, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [CVPR 2023] Visibility Constrained Wide-band Illumination Spectrum Design for Seeing-in-the-Dark☆48Aug 4, 2023Updated 2 years ago
- [ECCV 2024 Oral] Code for RPBG: Towards Robust Neural Point-based Graphics in the Wild.☆44Aug 22, 2024Updated last year
- [CVPR 2025] Official code for the paper "SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis"☆135Mar 18, 2025Updated 11 months ago
- ☆11Aug 12, 2024Updated last year
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆12Jan 22, 2025Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Dec 5, 2024Updated last year
- Investigating and Defending Shortcut Learning in Personalized Diffusion Models☆13Nov 19, 2024Updated last year
- ☆12Nov 2, 2025Updated 3 months ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- ☆16Oct 13, 2025Updated 4 months ago
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆22Nov 18, 2025Updated 3 months ago
- [NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop☆14Apr 13, 2023Updated 2 years ago