MaureenZOU / m3-spatialLinks
[ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory
☆162Updated last month
Alternatives and similar repositories for m3-spatial
Users that are interested in m3-spatial are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆185Updated last month
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆120Updated last week
- Feature splatting based on INRIA GS rasterizer☆78Updated 2 months ago
- SceneFun3D ToolKit☆136Updated last month
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆165Updated 2 months ago
- [ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes☆143Updated 2 weeks ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆103Updated last month
- PhyRecon: Physically Plausible Neural Scene Reconstruction☆156Updated 2 months ago
- ☆134Updated 6 months ago
- DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction☆122Updated 3 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆115Updated this week
- Generative World Explorer☆143Updated 6 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆101Updated 2 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆105Updated last month
- This is the official repository for "EgoLifter Open-world 3D Segmentation for Egocentric Perception, ECCV 2024"☆122Updated 7 months ago
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆182Updated 2 weeks ago
- ☆100Updated 10 months ago
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆140Updated 6 months ago
- Fillerbuster: Multi-View Scene Completion for Casual Captures☆101Updated 3 months ago
- [ArXiv 2025] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆285Updated last month
- Stereo4D dataset and processing code☆222Updated last month
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆105Updated last month
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆277Updated 3 months ago
- ☆69Updated last year
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆227Updated 8 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆84Updated last week
- Self-reimplemented version of Long-LRM.☆160Updated last month
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆123Updated 2 months ago
- This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arx…☆144Updated 2 months ago