yuanpengtu / VideoAnydoorLinks
[SIGGRAGH'25] Official repository of VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control
☆21Updated 2 months ago
Alternatives and similar repositories for VideoAnydoor
Users that are interested in VideoAnydoor are comparing it to the libraries listed below
Sorting:
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆74Updated last month
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆56Updated last month
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Updated last year
- [ICASSP2025] ConcealGS: Conceal Implicit Information in 3D Gaussian Splatting☆17Updated 7 months ago
- [Arxiv 25'] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆41Updated 3 weeks ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆26Updated 11 months ago
- Self-reimplemented version of 4D-LRM.☆52Updated 3 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated last year
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆95Updated 3 months ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆70Updated 9 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆94Updated 5 months ago
- ☆22Updated 2 months ago
- [CVPR 2025] Open-World Amodal Appearance Completion☆35Updated 2 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆31Updated 3 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆21Updated 3 months ago
- Official Release of ICCV 2025 paper -- DiscretizedSDF☆90Updated 3 weeks ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆45Updated last week
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆97Updated last month
- ☆78Updated 3 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆50Updated last month
- ☆17Updated 5 months ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆99Updated last year
- Open-world 3D part segmentation of point clouds☆84Updated last month
- ☆75Updated 3 months ago
- [AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding☆23Updated 3 months ago
- [ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting☆100Updated this week
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆48Updated last month
- [CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions☆135Updated last year
- VideoDirector [CVPR 2025]☆28Updated 5 months ago