ada-cheng / Image_FMAPLinks
☆16Updated last year
Alternatives and similar repositories for Image_FMAP
Users that are interested in Image_FMAP are comparing it to the libraries listed below
Sorting:
- Code release for paper "Test-Time Training Done Right"☆181Updated last week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆118Updated 2 weeks ago
- [ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory☆177Updated last month
- ☆131Updated 6 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆95Updated 4 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆43Updated 2 months ago
- Code for PointInfinity: Resolution-Invariant Point Diffusion Models☆31Updated last year
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆135Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆78Updated 4 months ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆182Updated 2 months ago
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆75Updated last year
- Seeing World Dynamics in a Nutshell☆109Updated 4 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆81Updated 2 weeks ago
- ☆39Updated 3 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆115Updated 3 months ago
- Repository of the 3DV paper "Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes".☆26Updated 7 months ago
- Open-world 3D part segmentation of point clouds☆83Updated 2 weeks ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆116Updated 2 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆109Updated 8 months ago
- Cameras as Relative Positional Encoding☆265Updated this week
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆135Updated 3 months ago
- SceneFun3D ToolKit☆147Updated 3 months ago
- ☆71Updated last month
- Unifying 2D and 3D Vision-Language Understanding☆95Updated 3 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆137Updated last week
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆149Updated this week
- Self-reimplemented version of 4D-LRM.☆47Updated last month
- (CVPR 2025 Highlight) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆224Updated last week
- [AAAI 2025] DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors☆207Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆90Updated last year