MaureenZOU / m3-spatialLinks
[ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory
☆164Updated last month
Alternatives and similar repositories for m3-spatial
Users that are interested in m3-spatial are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆153Updated last month
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆192Updated 2 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆167Updated 2 weeks ago
- SceneFun3D ToolKit☆142Updated 2 months ago
- [ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes☆183Updated last month
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆177Updated 3 months ago
- Feature splatting based on INRIA GS rasterizer☆80Updated 3 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆107Updated last month
- ☆135Updated 6 months ago
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆185Updated last month
- PhyRecon: Physically Plausible Neural Scene Reconstruction☆158Updated 3 months ago
- Fillerbuster: Multi-View Scene Completion for Casual Captures☆103Updated 4 months ago
- Stereo4D dataset and processing code☆243Updated 2 months ago
- DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction☆123Updated 3 months ago
- [ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models☆127Updated 10 months ago
- This is the official repository for "EgoLifter Open-world 3D Segmentation for Egocentric Perception, ECCV 2024"☆123Updated 7 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆110Updated 2 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- ☆100Updated 11 months ago
- ☆192Updated last month
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆314Updated last month
- [ CVPR 2025 ] We introduce LT3SD, a novel latent 3D scene diffusion approach enabling high-fidelity generation of infinite 3D environment…☆158Updated 3 weeks ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆112Updated 2 months ago
- A simple training-free approach adapting DUSt3R for dynamic scenes.☆371Updated 2 months ago
- [ECCV'24] FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information☆155Updated last month
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆123Updated last month
- This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arx…☆150Updated 3 months ago
- ☆63Updated 6 months ago
- MUSt3R: Multi-view Network for Stereo 3D Reconstruction☆150Updated this week
- Repository for running the VGGT model in PyTorch☆151Updated 2 months ago