ada-cheng / Image_FMAP
☆16Updated last year
Alternatives and similar repositories for Image_FMAP
Users that are interested in Image_FMAP are comparing it to the libraries listed below
Sorting:
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆91Updated last week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆99Updated last month
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆86Updated last year
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆18Updated 5 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆48Updated 2 weeks ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 6 months ago
- Code for PointInfinity: Resolution-Invariant Point Diffusion Models☆30Updated 11 months ago
- ☆32Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆29Updated last week
- Open-world 3D part segmentation of point clouds☆78Updated 2 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", arXiv 2025.☆62Updated last month
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆35Updated last week
- Repository of the 3DV paper "Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes".☆25Updated 5 months ago
- Geometry-aware Novel View Synthesis with Pre-trained 2D Prior☆40Updated last year
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆44Updated 5 months ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆122Updated last month
- Unifying 2D and 3D Vision-Language Understanding☆82Updated last month
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆50Updated 10 months ago
- https://coshand.cs.columbia.edu/☆16Updated 6 months ago
- ☆35Updated last month
- ☆52Updated 3 weeks ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆72Updated this week
- [ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory☆124Updated last week
- Program synthesis for 3D spatial reasoning☆31Updated 2 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆31Updated 3 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆101Updated last month
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆72Updated last week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 5 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆118Updated last year
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆71Updated last year