yejun688 / CVPR2025_oral_paper_listLinks

😎 A curated list of CVPR 2025 Oral paper. Total 96

☆57

Alternatives and similar repositories for CVPR2025_oral_paper_list

Users that are interested in CVPR2025_oral_paper_list are comparing it to the libraries listed below

Sorting:

Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆202Updated last month
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆319Updated 2 months ago
mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM
A paper list for spatial reasoning
☆411Updated last week
AnjieCheng / SpatialRGPT
[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
☆284Updated 11 months ago
Zhangwenyao1 / DreamVLA
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆231Updated 2 months ago
dvlab-research / VisionReasoner
Vision Manus: Your versatile Visual AI assistant
☆300Updated last month
diankun-wu / Spatial-MLLM
Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆387Updated 5 months ago
OuyangKun10 / SpaceR
SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning
☆98Updated 4 months ago
Hoyyyaard / LSceneLLM
☆61Updated 8 months ago
JunyaoHu / academic-project-page-template-vue
A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
☆297Updated 4 months ago
weijiawu / Awesome-Visual-Reinforcement-Learning
📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.
☆339Updated 2 weeks ago
LaVi-Lab / Video-3D-LLM
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆179Updated 5 months ago
pickxiguapi / Embodied-R1
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
☆104Updated 3 months ago
lif314 / Awesome-Spatial-Intelligence
Awesome Spatial Intelligence (Personal Use)
☆43Updated this week
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆286Updated 2 weeks ago
limezc / Awesome-CVPR-2025-Papers
☆16Updated 5 months ago
Koorye / Inspire
Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆45Updated 2 months ago
liudaizong / Awesome-3D-Visual-Grounding
😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.
☆244Updated 2 weeks ago
tanhuajie / Reason-RFT
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
☆233Updated last month
iris0329 / SeeGround
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆188Updated 7 months ago
facebookresearch / nwm
Official code for the CVPR 2025 paper "Navigation World Models".
☆448Updated last week
qizekun / SoFar
[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
☆203Updated 5 months ago
MINT-SJTU / STI-Bench
STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
☆33Updated 4 months ago
PzySeere / MetaSpatial
MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …
☆193Updated 6 months ago
lorebianchi98 / Talk2DINO
[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…
☆145Updated 2 weeks ago
HCPLab-SYSU / EXPRESS-Bench
Embodied Question Answering (EQA) benchmark and method (ICCV 2025)
☆43Updated 3 months ago
VinceOuti / Open3DVQA
☆30Updated last week
PKU-HMI-Lab / LIFT3D
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆167Updated 5 months ago
Visual-AI / 3DRS
[NeurIPS 2025] MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆122Updated 3 weeks ago
zhousheng97 / EgoTextVQA
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
☆41Updated 5 months ago