ChocoWu / Any2CaptionLinks
This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
☆50Updated 8 months ago
Alternatives and similar repositories for Any2Caption
Users that are interested in Any2Caption are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆70Updated last week
- ☆51Updated last week
- ☆29Updated 9 months ago
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆193Updated last week
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Updated last year
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆87Updated 7 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆99Updated last month
- [ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance☆171Updated last month
- Unified Video Editing with Temporal Reasoner☆86Updated last week
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆76Updated 4 months ago
- ☆47Updated 8 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generation☆211Updated 2 months ago
- [AAAI'25] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆100Updated last year
- Official implementation of "Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation" (ICCV 2…☆77Updated 4 months ago
- [CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆158Updated 8 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆108Updated 3 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆84Updated 3 months ago
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆127Updated 5 months ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆76Updated 6 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 7 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆192Updated last week
- [ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation☆97Updated last year
- [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆161Updated last month
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆25Updated last month
- [SIGGRAPH 2025] Official implementation of 'Motion Inversion For Video Customization'☆152Updated last year
- Implementation Code for Omni-Effects☆163Updated 2 weeks ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 11 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆115Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated 11 months ago
- HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.☆128Updated 5 months ago