yejun688 / CVPR2025_oral_paper_listLinks
😎 A curated list of CVPR 2025 Oral paper. Total 96
☆59Updated last month
Alternatives and similar repositories for CVPR2025_oral_paper_list
Users that are interested in CVPR2025_oral_paper_list are comparing it to the libraries listed below
Sorting:
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆326Updated 3 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆305Updated last year
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆420Updated this week
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆218Updated 3 weeks ago
- A paper list for spatial reasoning☆588Updated 2 weeks ago
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆375Updated this week
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆102Updated 6 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆189Updated 7 months ago
- Vision Manus: Your versatile Visual AI assistant☆305Updated 2 months ago
- ☆70Updated 9 months ago
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆273Updated this week
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆189Updated last month
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆253Updated 3 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆502Updated last month
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆202Updated 8 months ago
- 😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.☆258Updated 2 weeks ago
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆24Updated 11 months ago
- We used a web scraper to obtain all the papers from ECCV that have not yet been officially announced, making them available for those who…☆24Updated last year
- ☆16Updated 7 months ago
- [NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆137Updated last month
- Thinking in 360°: Humanoid Visual Search in the Wild☆105Updated last month
- Official repo and evaluation implementation of VSI-Bench☆658Updated 5 months ago
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆582Updated 5 months ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆98Updated 7 months ago
- Awesome Spatial Intelligence (Personal Use)☆45Updated this week
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆361Updated 2 months ago
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆157Updated last year
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆44Updated 4 months ago
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆330Updated last week
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆120Updated 2 months ago