About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. π₯ [Paper + Code + Demo]
β866Apr 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for top-cvpr-2025-papers
Users that are interested in top-cvpr-2025-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]β735Apr 15, 2026Updated 3 weeks ago
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.β1,166Jan 23, 2025Updated last year
- β73Feb 27, 2026Updated 2 months ago
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0β¦β3,377Updated this week
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)β836Mar 18, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,671May 1, 2026Updated last week
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Renderingβ46Oct 15, 2025Updated 6 months ago
- [MICCAI 2025] FEATοΌFull-Dimensional Efficient Attention Transformer for Medical Video Generation.β23Sep 24, 2025Updated 7 months ago
- Scaling Vision Pre-Training to 4K Resolutionβ227Jan 4, 2026Updated 4 months ago
- πA curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.β790Nov 5, 2025Updated 6 months ago
- β104Mar 31, 2025Updated last year
- MambaGlue: Fast and Robust Local Feature Matching With Mamba @ ICRA'25β274Updated this week
- MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion (CVPR 2025)β504Jul 9, 2025Updated 10 months ago
- [CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"β375Sep 25, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhereβ112Jul 10, 2024Updated last year
- Code for "Multi-view Reconstruction via SfM-guided Monocular Depth Estimation". CVPR 2025 (Oral Presentation)β367Aug 6, 2025Updated 9 months ago
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"β1,357Jun 16, 2025Updated 10 months ago
- Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"β1,289Jan 5, 2026Updated 4 months ago
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"β430Nov 24, 2025Updated 5 months ago
- SOTA Spherical Target-based Calibration (accepted in IROS'25)β29Mar 27, 2026Updated last month
- [ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed forβ¦β6,974Updated this week
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anythingβ1,375May 1, 2025Updated last year
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"β919Jan 27, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Cameras as Relative Positional Encodingβ711Dec 18, 2025Updated 4 months ago
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lβ¦β9,364Mar 27, 2026Updated last month
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.β523Apr 1, 2025Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.β98Dec 17, 2024Updated last year
- This is a project on visual spatial reasoning tasks-SIBenchβ26Jan 12, 2026Updated 3 months ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,437Nov 2, 2025Updated 6 months ago
- The collection of medical VLP paparsβ20Jul 24, 2024Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!β2,264Apr 13, 2026Updated 3 weeks ago
- A curated list of awesome DUST3R/MAST3R related papers.β35Aug 5, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained modeβ¦β19,095Apr 7, 2026Updated last month
- SIM4Dβ30Mar 27, 2025Updated last year
- [TMLR 2026] Gaga: Group Any Gaussians via 3D-aware Memory Bankβ406May 2, 2026Updated last week
- [CVPR 2025]MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMβ333Dec 7, 2025Updated 5 months ago
- [CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondencesβ594Dec 2, 2024Updated last year
- Tennis Detection and Visualization System An advanced computer vision system for tennis match analysis that tracks players and ball moveβ¦β34Updated this week
- Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understandingβ35Dec 23, 2025Updated 4 months ago