Omni Controllable Video Diffusion
☆41Dec 22, 2025Updated 2 months ago
Alternatives and similar repositories for OmniVDiff
Users that are interested in OmniVDiff are comparing it to the libraries listed below
Sorting:
- MetricSolver☆20Apr 17, 2025Updated 10 months ago
- [CVPR 2026] ViStoryBench: AI Story Visualization Benchmark☆137Updated this week
- ☆17Apr 17, 2025Updated 10 months ago
- [CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding☆76Feb 22, 2026Updated last week
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 4 months ago
- ☆15Jan 1, 2025Updated last year
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆47Dec 16, 2025Updated 2 months ago
- (I3D 2025) Normal-guided Detail-Preserving Neural Implicit Function for High-Fidelity 3D Surface Reconstruction [Proceedings of the ACM i…☆19May 26, 2025Updated 9 months ago
- ☆27Mar 3, 2025Updated last year
- Standardized DataLoaders for 3D Computer Vision☆26Mar 28, 2025Updated 11 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- ☆32Feb 7, 2026Updated 3 weeks ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆417Jul 25, 2025Updated 7 months ago
- Official Pytorch implementation of SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations☆29Nov 30, 2023Updated 2 years ago
- Code and data for UniEgoMotion (ICCV 2025)☆44Nov 11, 2025Updated 3 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆39Jun 9, 2025Updated 8 months ago
- ☆13Nov 21, 2025Updated 3 months ago
- RNb-NeuS2: Multi-View Surface Reconstruction Using Normal and Reflectance Cues☆50Dec 3, 2025Updated 3 months ago
- This project is the official implementation of "UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Gener…☆208Jan 29, 2026Updated last month
- HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.☆134Feb 20, 2026Updated 2 weeks ago
- AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models☆139Jan 6, 2026Updated 2 months ago
- Official Implementation of paper accepted by ICLR2025-MoDGS: Dynamic Gaussian Splatting from Casually-captured Monocular Videos with Dept…☆164Jun 24, 2025Updated 8 months ago
- [ICLR 2026] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆84Feb 17, 2026Updated 2 weeks ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated last year
- [ICCV 2025] "Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning".☆17Dec 11, 2025Updated 2 months ago
- ☆62Jul 1, 2025Updated 8 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 6 months ago
- ☆10Oct 5, 2022Updated 3 years ago
- MovieLabs Ontology for Media Creation (OMC)☆21Feb 7, 2026Updated last month
- [CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆96Updated this week
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆126Feb 21, 2026Updated last week
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- ☆26Dec 2, 2025Updated 3 months ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated 11 months ago
- ☆12Sep 19, 2021Updated 4 years ago
- [SPM2018] Sparse3D: A new global model for matching sparse RGB-D dataset with small inter-frame overlap☆12Aug 13, 2018Updated 7 years ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated 2 months ago
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated last month
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year