Benchmark dataset and code of MSRVTT-Personalization
☆51Nov 10, 2025Updated 4 months ago
Alternatives and similar repositories for MSRVTT-Personalization
Users that are interested in MSRVTT-Personalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆78Mar 2, 2026Updated 3 weeks ago
- Official implementation for BMVC 2021 paper Render In-between: Motion Guided Video Synthesis for Action Interpolation☆16Dec 23, 2021Updated 4 years ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 10 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆49Apr 3, 2025Updated 11 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆29Aug 26, 2025Updated 6 months ago
- ☆14Jun 2, 2025Updated 9 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆439Sep 24, 2025Updated 6 months ago
- ☆22Feb 13, 2026Updated last month
- Code for NeurIPS 2024 work "MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps"☆17Dec 11, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Oct 22, 2025Updated 5 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆52Feb 21, 2026Updated last month
- The Third Place Winner in Generative Track of the ECCV 2024 DD Challenge☆10Oct 11, 2024Updated last year
- ☆52Jan 6, 2026Updated 2 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 10 months ago
- Official PyTorch Implementation of “VLScene: Vision-Language Guidance Distillation for Camera-based 3D Semantic Scene Completion”(AAAI 20…☆14Oct 13, 2025Updated 5 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- Awesome-Text2Motion-Generation☆18Oct 26, 2023Updated 2 years ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆118May 14, 2025Updated 10 months ago
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 4 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)☆291Mar 12, 2026Updated last week
- Official implementations for paper: PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention☆19Oct 20, 2025Updated 5 months ago
- My implement of InstantBooth☆13Sep 11, 2023Updated 2 years ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 7 months ago
- ☆30Mar 4, 2025Updated last year
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 9 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆636Oct 29, 2025Updated 4 months ago
- [ICCV 2023 Oral] Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction☆110Apr 11, 2025Updated 11 months ago
- PyTorch code for EgoHMR (ICCV 2023): Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views☆74Jun 8, 2025Updated 9 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆676Oct 25, 2024Updated last year
- ☆13Feb 28, 2025Updated last year
- ☆132Dec 19, 2025Updated 3 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆706Jun 3, 2025Updated 9 months ago
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆128Jun 26, 2025Updated 8 months ago
- [CVPR 2023] DynaCam dataset - 3D human trajectories in global coordinates from videos captured by dynamic cameras☆80Jun 30, 2023Updated 2 years ago
- [CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).☆142Oct 1, 2025Updated 5 months ago
- ☆116Jun 28, 2024Updated last year
- Official Implementation of ConsisLoRA☆62Mar 28, 2025Updated 11 months ago
- Globally Consistent Probabilistic Human Motion Estimation☆23Feb 28, 2023Updated 3 years ago