Chenyu-Wang567 / All-Angles-BenchLinks
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
☆47Updated last month
Alternatives and similar repositories for All-Angles-Bench
Users that are interested in All-Angles-Bench are comparing it to the libraries listed below
Sorting:
- Self-reimplemented version of 4D-LRM.☆50Updated 2 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆21Updated 5 months ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆88Updated last week
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆55Updated last month
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆41Updated 2 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆127Updated last month
- From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D☆54Updated 3 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Updated 5 months ago
- ☆75Updated 2 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆50Updated last month
- A list of works on video generation towards world model☆162Updated 2 weeks ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆107Updated 3 weeks ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated last year
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆63Updated last month
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆219Updated last month
- [AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding☆22Updated 3 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆43Updated last week
- The official repository of "Sekai: A Video Dataset towards World Exploration"☆137Updated last month
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆17Updated 4 months ago
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆55Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆98Updated 5 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆73Updated last month
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆95Updated 2 months ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆98Updated last year
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆76Updated last month
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆190Updated last year
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆110Updated 8 months ago
- Official implementation of "URECA : Unique Region Caption Anything"☆55Updated last month
- [CVPR 2025] Open-World Amodal Appearance Completion☆32Updated last month
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆247Updated 3 weeks ago