About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. π₯ [Paper + Code + Demo]
β856Jun 16, 2025Updated 9 months ago
Alternatives and similar repositories for top-cvpr-2025-papers
Users that are interested in top-cvpr-2025-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. π₯ [Paper + Code + Demo]β738Jun 2, 2025Updated 9 months ago
- [CVPRW 2025] McByte - tracking in sports without training (No Train Yet Gain)β99Jul 22, 2025Updated 8 months ago
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.β1,159Jan 23, 2025Updated last year
- β64Feb 27, 2026Updated last month
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0β¦β3,109Mar 23, 2026Updated last week
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)β829Mar 18, 2026Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,660Mar 23, 2026Updated last week
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Renderingβ46Oct 15, 2025Updated 5 months ago
- [MICCAI 2025] FEATοΌFull-Dimensional Efficient Attention Transformer for Medical Video Generation.β23Sep 24, 2025Updated 6 months ago
- Scaling Vision Pre-Training to 4K Resolutionβ222Jan 4, 2026Updated 2 months ago
- πA curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.β786Nov 5, 2025Updated 4 months ago
- β101Mar 31, 2025Updated 11 months ago
- MambaGlue: Fast and Robust Local Feature Matching With Mamba @ ICRA'25β272Mar 13, 2025Updated last year
- MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion (CVPR 2025)β494Jul 9, 2025Updated 8 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"β371Sep 25, 2025Updated 6 months ago
- Code for "Multi-view Reconstruction via SfM-guided Monocular Depth Estimation". CVPR 2025 (Oral Presentation)β363Aug 6, 2025Updated 7 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhereβ112Jul 10, 2024Updated last year
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"β1,345Jun 16, 2025Updated 9 months ago
- Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"β1,260Jan 5, 2026Updated 2 months ago
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"β419Nov 24, 2025Updated 4 months ago
- [ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed forβ¦β6,012Updated this week
- SOTA Spherical Target-based Calibration (accepted in IROS'25)β29Updated this week
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"β901Jan 27, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anythingβ1,365May 1, 2025Updated 10 months ago
- Cameras as Relative Positional Encodingβ695Dec 18, 2025Updated 3 months ago
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lβ¦β9,278Updated this week
- This is a project on visual spatial reasoning tasks-SIBenchβ26Jan 12, 2026Updated 2 months ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.β517Apr 1, 2025Updated 11 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.β98Dec 17, 2024Updated last year
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,377Nov 2, 2025Updated 4 months ago
- The collection of medical VLP paparsβ20Jul 24, 2024Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!β2,212Mar 12, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoningβ19Oct 6, 2025Updated 5 months ago
- OcelStream is a modular DeepStream-based platform for real-time video analytics using custom models like YOLO, SAM, and D-Fine. It includβ¦β37Nov 20, 2025Updated 4 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained modeβ¦β18,791Mar 20, 2026Updated last week
- A curated list of awesome DUST3R/MAST3R related papers.β35Aug 5, 2025Updated 7 months ago
- SIM4Dβ30Mar 27, 2025Updated last year
- Gaga: Group Any Gaussians via 3D-aware Memory Bankβ402Aug 4, 2025Updated 7 months ago
- [CVPR 2025]MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMβ328Dec 7, 2025Updated 3 months ago