Chenyu-Wang567 / All-Angles-BenchLinks
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
☆56Updated 2 weeks ago
Alternatives and similar repositories for All-Angles-Bench
Users that are interested in All-Angles-Bench are comparing it to the libraries listed below
Sorting:
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆62Updated 4 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆59Updated 5 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆24Updated 8 months ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆66Updated 2 months ago
- Visual Spatial Tuning☆149Updated last week
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆126Updated last month
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆272Updated last month
- ☆102Updated last month
- Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".☆136Updated 2 months ago
- Self-reimplemented version of 4D-LRM.☆63Updated 6 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆45Updated 6 months ago
- ☆18Updated last month
- ☆51Updated 3 months ago
- The official implementation of paper “VChain: Chain-of-Visual-Thought for Reasoning in Video Generation”☆106Updated 2 months ago
- A list of works on video generation towards world model☆230Updated 2 weeks ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆131Updated 3 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆170Updated this week
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆53Updated last month
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Updated 7 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆85Updated 9 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆47Updated 7 months ago
- [NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆128Updated last week
- ☆30Updated last year
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆201Updated this week
- ☆42Updated 3 months ago
- ☆63Updated last month
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆186Updated 6 months ago
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆122Updated 4 months ago
- Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆103Updated 2 weeks ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆98Updated 5 months ago