ChocoWu / Any2Caption
This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
☆26Updated last week
Alternatives and similar repositories for Any2Caption:
Users that are interested in Any2Caption are comparing it to the libraries listed below
- [CVPR 2025] Official implementation of the paper "Generative Inbetweening through Frame-wise Conditions-Driven Video Generation"☆88Updated last month
- [CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.☆39Updated last week
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆23Updated 8 months ago
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance☆45Updated 6 months ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆25Updated 4 months ago
- ObjCtrl-2.5D☆43Updated last week
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆100Updated 4 months ago
- ☆20Updated 2 weeks ago
- ☆83Updated 7 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆86Updated 2 weeks ago
- Blending Custom Photos with Video Diffusion Transformers☆47Updated 2 months ago
- ☆23Updated 2 weeks ago
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆96Updated 4 months ago
- ☆67Updated 10 months ago
- Balanced Image Stylization with Style Matching Score☆28Updated last week
- [CVPR 2025 Highlight] Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis☆94Updated last week
- Official implementation of "Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices" (ICML 202…☆55Updated 4 months ago
- [ICLR 2024] Code for FreeNoise based on AnimateDiff☆107Updated last year
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆98Updated last year
- [ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation☆90Updated 5 months ago
- ☆42Updated 8 months ago
- ☆37Updated 10 months ago
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆54Updated 6 months ago
- Pusa: Thousands-handed Video Diffusion Model☆11Updated 2 weeks ago
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆100Updated 2 weeks ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆81Updated 4 months ago
- [AAAI-2025] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆90Updated 8 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆39Updated 7 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆45Updated last week
- Official Implementation of paper "Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models"☆52Updated 2 months ago