WeijieMax / EyeRealLinks
Offcial Code of EyeReal
☆98Updated 2 months ago
Alternatives and similar repositories for EyeReal
Users that are interested in EyeReal are comparing it to the libraries listed below
Sorting:
- Official implementation of "Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance"☆374Updated 2 weeks ago
- A list of works on video generation towards world model☆337Updated this week
- A paper list for spatial reasoning☆638Updated 3 weeks ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆167Updated 4 months ago
- Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Gener…☆327Updated 2 weeks ago
- [TCSVT 2025] Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View☆103Updated last month
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆203Updated 9 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆412Updated this week
- Cambrian-S: Towards Spatial Supersensing in Video☆488Updated last month
- Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆216Updated last month
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆314Updated 7 months ago
- A Large-scale Video Action Dataset☆388Updated 3 weeks ago
- PyTorch implementation of NEPA☆308Updated 2 weeks ago
- [ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.☆80Updated last month
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆390Updated 10 months ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆493Updated this week
- Orient Anything, ICML 2025☆372Updated 3 months ago
- Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models☆85Updated 3 weeks ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆259Updated this week
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆100Updated last month
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆113Updated 2 months ago
- Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction☆345Updated 2 weeks ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆431Updated this week
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆207Updated 3 months ago
- Collection of Highlight papers☆42Updated last year
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆212Updated 2 months ago
- Official repo for UAE☆164Updated last month
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆60Updated 7 months ago
- Generative World Explorer☆165Updated 7 months ago
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆90Updated 6 months ago