yuanpengtu / VideoAnydoorLinks
[SIGGRAGH'25] Official repository of VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control
☆27Updated last week
Alternatives and similar repositories for VideoAnydoor
Users that are interested in VideoAnydoor are comparing it to the libraries listed below
Sorting:
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆75Updated last month
- Self-reimplemented version of 4D-LRM.☆63Updated 6 months ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆53Updated last month
- [CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions☆134Updated last year
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆59Updated last month
- VideoDirector [CVPR 2025]☆33Updated 2 weeks ago
- [ICASSP2025] ConcealGS: Conceal Implicit Information in 3D Gaussian Splatting☆18Updated 10 months ago
- ☆83Updated 6 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆56Updated 3 months ago
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆41Updated last month
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆53Updated 8 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆37Updated 11 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆80Updated last year
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆27Updated last year
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆100Updated 8 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆79Updated 7 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆214Updated 10 months ago
- This project is the official implementation of "UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Gener…☆41Updated this week
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated last year
- Seeing World Dynamics in a Nutshell☆111Updated 8 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆30Updated last year
- Open-world 3D part segmentation of point clouds☆104Updated 4 months ago
- ☆78Updated last year
- ☆258Updated last month
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Updated 11 months ago
- ☆30Updated last year
- [CVPR 2025] Open-World Amodal Appearance Completion☆43Updated last month
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".☆82Updated last month
- [NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models☆24Updated 5 months ago