ada-cheng / Image_FMAPLinks
☆16Updated last year
Alternatives and similar repositories for Image_FMAP
Users that are interested in Image_FMAP are comparing it to the libraries listed below
Sorting:
- Code release for paper "Test-Time Training Done Right"☆321Updated last week
- ☆118Updated 3 months ago
- ☆150Updated 10 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆164Updated last week
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆53Updated last week
- Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆94Updated this week
- Code for PointInfinity: Resolution-Invariant Point Diffusion Models☆35Updated last year
- ☆167Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆98Updated 8 months ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆240Updated 7 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆93Updated last year
- ☆52Updated 8 months ago
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆40Updated 5 months ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆137Updated 4 months ago
- [AAAI 2025] DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors☆220Updated last year
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆270Updated last month
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆271Updated last month
- (CVPR 2025 Highlight) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆247Updated 4 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆83Updated last year
- Open-world 3D part segmentation of point clouds☆103Updated 4 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆209Updated this week
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆132Updated 7 months ago
- ☆17Updated last month
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆160Updated last month
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆122Updated 4 months ago
- A list of works on video generation towards world model☆222Updated this week
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆78Updated 5 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆337Updated this week
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆104Updated 8 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year