Chenyu-Wang567 / All-Angles-BenchLinks
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
☆45Updated 2 weeks ago
Alternatives and similar repositories for All-Angles-Bench
Users that are interested in All-Angles-Bench are comparing it to the libraries listed below
Sorting:
- Self-reimplemented version of 4D-LRM.☆48Updated 2 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆20Updated 4 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆40Updated 2 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆67Updated this week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆122Updated last week
- ☆72Updated 2 months ago
- From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D☆52Updated 2 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆47Updated 2 weeks ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆72Updated last month
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆94Updated this week
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated 11 months ago
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆187Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆186Updated 3 weeks ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆95Updated 4 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆54Updated 3 weeks ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆41Updated last year
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆110Updated 8 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆140Updated 2 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆45Updated 4 months ago
- A list of works on video generation towards world model☆161Updated this week
- [CVPR 2025] Open-World Amodal Appearance Completion☆31Updated 3 weeks ago
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆111Updated last month
- Official implementation of "URECA : Unique Region Caption Anything"☆55Updated 3 weeks ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆97Updated last month
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆52Updated last month
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆311Updated last month
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆140Updated 2 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆314Updated last year
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆98Updated last year
- LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆91Updated 3 weeks ago