[ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models
☆46Nov 20, 2025Updated 5 months ago
Alternatives and similar repositories for VLM4D
Users that are interested in VLM4D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆43Apr 9, 2026Updated 3 weeks ago
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image☆67Dec 23, 2025Updated 4 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆125Feb 22, 2026Updated 2 months ago
- [CVPR 2025] Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields☆40Oct 18, 2025Updated 6 months ago
- Orienting Latent Actions for Video World Modeling☆88Apr 20, 2026Updated 2 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆27Apr 9, 2025Updated last year
- [NeurIPS 2024] MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting☆153May 25, 2025Updated 11 months ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆29Jun 16, 2025Updated 10 months ago
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆115Nov 25, 2025Updated 5 months ago
- Code for RA-L paper "Multi-Robot Object SLAM using Distributed Variational Inference"☆25May 9, 2024Updated last year
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆50May 23, 2025Updated 11 months ago
- real-to-sim evaluation suite for robot parkour☆11Jan 19, 2025Updated last year
- [IEEE VR'22] SPAA: Stealthy Projector-based Adversarial Attacks on Deep Image Classifiers☆12Jun 21, 2025Updated 10 months ago
- [CVPR 2026] Dexterous World Models☆92Apr 6, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Code Repository for the CoRL 2024 Paper: "Toward General Object-Level Mapping from Sparse Views with 3D Diffusion Priors"☆32Jan 7, 2025Updated last year
- [CVPR2026] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning☆45Mar 27, 2026Updated last month
- ☆54Feb 9, 2026Updated 2 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated last year
- Depth-Guided Dynamic Neural Radiance Field using RGB-D data☆16Apr 4, 2023Updated 3 years ago
- ☆44Jan 4, 2026Updated 4 months ago
- Official code for paper: "RayRoPE: Projective Ray Positional Encoding for Multi-view Attention"☆129Mar 27, 2026Updated last month
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆84Jul 6, 2025Updated 10 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆27Apr 14, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python implementation of HlibertSort for sorting 3D point clouds using space-filling curves☆11Apr 17, 2019Updated 7 years ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- Siggraph 2025 Journal track☆26Aug 13, 2025Updated 8 months ago
- ☆49Sep 19, 2024Updated last year
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆23Sep 3, 2025Updated 8 months ago
- TrOCR but 2 to 3 times faster☆11Oct 22, 2022Updated 3 years ago
- ☆15Jun 6, 2024Updated last year
- Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…☆14Dec 9, 2021Updated 4 years ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCV 2024] DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting☆125Feb 1, 2025Updated last year
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆241Oct 29, 2025Updated 6 months ago
- A simple state update rule to enhance length generalization for CUT3R☆649Oct 1, 2025Updated 7 months ago
- Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction☆25Jul 29, 2024Updated last year
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆442Oct 2, 2025Updated 7 months ago
- [arXiv'25]🌈 Unseen 3D Geometry Reasoning from a Single Image.☆82Jul 10, 2025Updated 9 months ago
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆30Jan 28, 2026Updated 3 months ago