[ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models
☆49Nov 20, 2025Updated 6 months ago
Alternatives and similar repositories for VLM4D
Users that are interested in VLM4D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind☆70Jun 8, 2026Updated last week
- ☆46Apr 9, 2026Updated 2 months ago
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image☆68May 8, 2026Updated last month
- [CVPR 2025] Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields☆40Oct 18, 2025Updated 8 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆127Feb 22, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR'26] UniGame code implementation☆20Apr 21, 2026Updated last month
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆27Apr 9, 2025Updated last year
- [NeurIPS 2024] MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting☆154May 25, 2025Updated last year
- [ICML 2026] Orienting Latent Actions for Video World Modeling☆108Apr 20, 2026Updated last month
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆119Nov 25, 2025Updated 6 months ago
- Code for RA-L paper "Multi-Robot Object SLAM using Distributed Variational Inference"☆25May 9, 2024Updated 2 years ago
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆50May 23, 2025Updated last year
- real-to-sim evaluation suite for robot parkour☆11Jan 19, 2025Updated last year
- [CVPR 2026] Dexterous World Models☆110May 23, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Code Repository for the CoRL 2024 Paper: "Toward General Object-Level Mapping from Sparse Views with 3D Diffusion Priors"☆32Jan 7, 2025Updated last year
- ☆55Feb 9, 2026Updated 4 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated last year
- Official code for paper: "RayRoPE: Projective Ray Positional Encoding for Multi-view Attention"☆134Mar 27, 2026Updated 2 months ago
- [CVPR2026] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning☆54Mar 27, 2026Updated 2 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆84Jul 6, 2025Updated 11 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆28Apr 14, 2025Updated last year
- Python implementation of HlibertSort for sorting 3D point clouds using space-filling curves☆11Apr 17, 2019Updated 7 years ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆45Jan 4, 2026Updated 5 months ago
- Siggraph 2025 Journal track☆27Aug 13, 2025Updated 10 months ago
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆24Sep 3, 2025Updated 9 months ago
- ☆15May 21, 2026Updated 3 weeks ago
- ☆51Sep 19, 2024Updated last year
- [ECCV 2024] DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting☆127May 13, 2026Updated last month
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆242Oct 29, 2025Updated 7 months ago
- Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction☆25Jul 29, 2024Updated last year
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆449Oct 2, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] A simple state update rule to enhance length generalization for CUT3R☆674May 11, 2026Updated last month
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆32Jan 28, 2026Updated 4 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆38Dec 30, 2025Updated 5 months ago
- ☆11May 9, 2022Updated 4 years ago
- [ECCV'2024] MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views☆66Aug 4, 2025Updated 10 months ago
- 利用Unity复刻的跳一跳小游戏☆10Apr 7, 2021Updated 5 years ago
- [ICML 2026] 🎨 Occluded 3D Scene Reconstruction from a Single Image.☆85Jun 9, 2026Updated last week