ChocoWu / Any2CaptionLinks
This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
☆46Updated 8 months ago
Alternatives and similar repositories for Any2Caption
Users that are interested in Any2Caption are comparing it to the libraries listed below
Sorting:
- ☆29Updated 8 months ago
- [AAAI'25] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆99Updated last year
- Official implementation of "Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation" (ICCV 2…☆77Updated 3 months ago
- [ICCV 2025] Balanced Image Stylization with Style Matching Score☆65Updated 2 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆63Updated 6 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆75Updated 4 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆94Updated last week
- ☆50Updated 2 months ago
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆127Updated 5 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆23Updated 2 weeks ago
- [ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation☆97Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆108Updated 2 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆115Updated last year
- Implementation Code for Omni-Effects☆159Updated 2 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 10 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆87Updated 6 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Updated last year
- [ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance☆169Updated last month
- [CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆159Updated 7 months ago
- [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆155Updated 2 weeks ago
- ☆47Updated 7 months ago
- ☆90Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated 11 months ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆75Updated 5 months ago
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆69Updated 4 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆88Updated last year
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆106Updated last year
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆122Updated last month
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆171Updated last week
- HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.☆126Updated 4 months ago