ChocoWu / Any2CaptionLinks

This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation

☆50

Alternatives and similar repositories for Any2Caption

Users that are interested in Any2Caption are comparing it to the libraries listed below

Sorting:

KlingTeam / SVG-T2I
Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
☆70Updated last week
kinam0252 / TIC-FT
☆51Updated last week
vfx-creator0 / VFXCreator
☆29Updated 9 months ago
JaydenLyh / Reward-Forcing
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆193Updated last week
jianzongwu / MotionBooth
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆138Updated last year
desaixie / pa_vdm
CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆87Updated 7 months ago
Phantom-video / Phantom-Data
Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset
☆99Updated last month
quanhaol / MagicMotion
[ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
☆171Updated last month
knightyxp / VideoCoF
Unified Video Editing with Temporal Reasoner
☆86Updated last week
ditflow / ditflow
Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers
☆76Updated 4 months ago
KaiyueSun98 / T2I-Personalization-with-AR
☆47Updated 8 months ago
justincui03 / Self-Forcing-Plus-Plus
Official Repo for Self-Forcing++ High Quality Long Video Generation
☆211Updated 2 months ago
liyaowei-stu / ImageConductor
[AAAI'25] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis
☆100Updated last year
chen-yingjie / Perception-as-Control
Official implementation of "Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation" (ICCV 2…
☆77Updated 4 months ago
ant-research / LeviTor
[CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
☆158Updated 8 months ago
arthur-qiu / FreeTraj
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
☆108Updated 3 months ago
Bujiazi / HiFlow
[NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
☆84Updated 3 months ago
dvlab-research / MagicMirror
[ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers
☆127Updated 5 months ago
wz0919 / DreamRunner
[AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
☆76Updated 6 months ago
ML-GSAI / Concat-ID
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
☆65Updated 7 months ago
PKU-YuanGroup / Edit-R1
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆192Updated last week
sjtuplayer / MotionMaster
[ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation
☆97Updated last year
KlingTeam / StyleMaster
[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation
☆161Updated last month
TencentARC / BlobCtrl
[SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing
☆25Updated last month
EnVision-Research / MotionInversion
[SIGGRAPH 2025] Official implementation of 'Motion Inversion For Video Customization'
☆152Updated last year
AMAP-ML / Omni-Effects
Implementation Code for Omni-Effects
☆163Updated 2 weeks ago
feizc / Ingredients
Blending Custom Photos with Video Diffusion Transformers
☆48Updated 11 months ago
Kmcode1 / SG-I2V
This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.
☆115Updated last year
wutong16 / FiVA
[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"
☆73Updated 11 months ago
vivoCameraResearch / Hyper-Motion
HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.
☆128Updated 5 months ago