SkalskiP/top-cvpr-2025-papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SkalskiP/top-cvpr-2025-papers)

SkalskiP / top-cvpr-2025-papers

About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]

☆891

Alternatives and similar repositories for top-cvpr-2025-papers

Users that are interested in top-cvpr-2025-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SkalskiP / top-cvpr-2024-papers
View on GitHub
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
☆734Apr 15, 2026Updated 3 months ago
SkalskiP / top-cvpr-2026-papers
View on GitHub
About This repository is a curated collection of the most exciting and influential CVPR 2026 papers. 🔥 [Paper + Code + Demo]
☆561Jun 6, 2026Updated last month
facebookresearch / EdgeTAM
View on GitHub
[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
☆950Jan 27, 2026Updated 6 months ago
ClaudiaCuttano / SAMWISE
View on GitHub
[CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆386Sep 25, 2025Updated 10 months ago
roboflow / trackers
View on GitHub
Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…
☆3,558Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ClaudiaCuttano / SANSA
View on GitHub
[NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."
☆203Dec 17, 2025Updated 7 months ago
roboflow / maestro
View on GitHub
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
☆2,689Updated this week
SkalskiP / vlms-zero-to-hero
View on GitHub
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
☆1,179Jan 23, 2025Updated last year
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,330Apr 13, 2026Updated 3 months ago
roboflow / rf-detr
View on GitHub
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning…
☆8,783Updated this week
visinf / INSID3
View on GitHub
[CVPR 2026 Oral] "INSID3: Training-Free In-Context Segmentation with DINOv3"
☆700Jun 26, 2026Updated last month
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆11,051Jul 15, 2026Updated 2 weeks ago
tstanczyk95 / McByte
View on GitHub
[CVPRW 2025] McByte - tracking in sports without training (No Train Yet Gain)
☆108Jul 22, 2025Updated last year
roboflow / notebooks
View on GitHub
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…
☆9,588Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,602Jul 17, 2026Updated last week
lpiccinelli-eth / UniK3D
View on GitHub
[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation
☆744Sep 14, 2025Updated 10 months ago
Raessan / dinov3_deepstream
View on GitHub
DeepStream integration of Meta’s DINOv3 backbone with lightweight heads for vision tasks.
☆26Feb 5, 2026Updated 5 months ago
facebookresearch / vggt
View on GitHub
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆14,053May 19, 2026Updated 2 months ago
fkryan / gazelle
View on GitHub
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆852Mar 18, 2026Updated 4 months ago
roboflow / sports
View on GitHub
computer vision and sports
☆5,272Jul 22, 2026Updated last week
jovanavidenovic / DAM4SAM
View on GitHub
[CVPR 2025, IJCV 2026] "A Distractor-Aware Memory for Visual Object Tracking with SAM2", "Distractor-Aware Memory-Based Visual Object Tra…
☆490Apr 7, 2026Updated 3 months ago
NVlabs / FoundationStereo
View on GitHub
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
☆2,840Dec 19, 2025Updated 7 months ago
suhwan-cho / FindTrack
View on GitHub
[ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
☆82Oct 22, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆6,001Updated this week
Roboflow-Universe / finetune-RF-DETR
View on GitHub
Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on custom datasets.
☆35Dec 3, 2025Updated 7 months ago
siyuanliii / masa
View on GitHub
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
☆1,375May 1, 2025Updated last year
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,470Aug 27, 2025Updated 11 months ago
THU-MIG / yoloe
View on GitHub
YOLOE: Real-Time Seeing Anything [ICCV 2025]
☆2,218Jun 26, 2025Updated last year
Davidyao99 / uni4d
View on GitHub
[CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
☆225May 25, 2025Updated last year
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆985Feb 27, 2026Updated 5 months ago
mega-sam / mega-sam
View on GitHub
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
☆1,338Jan 5, 2026Updated 6 months ago
microsoft / MoGe
View on GitHub
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
☆2,712Jul 21, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,614May 30, 2026Updated 2 months ago
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,405Mar 23, 2026Updated 4 months ago
aharley / alltracker
View on GitHub
AllTracker is a model for tracking all pixels in a video.
☆421May 8, 2026Updated 2 months ago
allenai / molmo
View on GitHub
Code for the Molmo Vision-Language Model
☆921Dec 12, 2024Updated last year
roboflow / supervision
View on GitHub
We write your reusable computer vision tools. 💜
☆48,460Updated this week
ruili3 / awesome-dust3r
View on GitHub
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
☆802Nov 5, 2025Updated 8 months ago
Junyi42 / monst3r
View on GitHub
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
☆1,383Jun 16, 2025Updated last year