[ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
☆279Feb 10, 2026Updated 2 months ago
Alternatives and similar repositories for TAPTR
Users that are interested in TAPTR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space☆1,042Aug 8, 2025Updated 8 months ago
- Dense Optical Tracking: Connecting the Dots☆321Nov 19, 2024Updated last year
- CoTracker is a model for tracking any point (pixel) on a video.☆4,925Mar 3, 2026Updated last month
- Tracking Any Point (TAP)☆1,866Mar 30, 2026Updated last month
- [CVPR 2025] RollingDepth: Video Depth without Video Models☆607Mar 18, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆212Apr 16, 2025Updated last year
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆145Apr 6, 2025Updated last year
- ☆1,257Aug 2, 2025Updated 9 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆514Dec 4, 2024Updated last year
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆112Jul 10, 2024Updated last year
- [ICLR 25, TPAMI 26, CVPR 26] Track-On: Online Point Tracking with Memory☆151Mar 13, 2026Updated last month
- MFT: Long-Term Tracking of Every Pixel -- code for the WACV 2024 paper☆68Nov 15, 2024Updated last year
- Efficient Track Anything☆798Jan 6, 2025Updated last year
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆182Oct 15, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [3DV 2025] Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, …☆976Mar 26, 2025Updated last year
- Code release for CVPR'24 submission 'OmniGlue'☆713Aug 12, 2024Updated last year
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆555Nov 23, 2024Updated last year
- High-resolution models for human tasks.☆5,350Nov 18, 2024Updated last year
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,355Jun 16, 2025Updated 10 months ago
- ☆2,264Jun 11, 2024Updated last year
- [CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences☆593Dec 2, 2024Updated last year
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,374May 1, 2025Updated last year
- VGGSfM: Visual Geometry Grounded Deep Structure From Motion☆1,378Mar 11, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆442Oct 2, 2025Updated 7 months ago
- [Arxiv 2024] MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms☆15Dec 1, 2024Updated last year
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,543Nov 30, 2025Updated 5 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,888Oct 7, 2025Updated 6 months ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,636Jun 28, 2024Updated last year
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,104Jan 21, 2025Updated last year
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 10 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,369Jul 23, 2025Updated 9 months ago
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆393Dec 28, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆2,432Nov 2, 2025Updated 6 months ago
- ☆87Jan 14, 2025Updated last year
- [DEPRECATED] GLOMAP - Global Structured-from-Motion Revisited☆2,308Jan 30, 2026Updated 3 months ago
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆799Aug 16, 2024Updated last year
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆3,129Dec 10, 2025Updated 4 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆24May 5, 2024Updated last year
- [ECCV 2024 Oral] Code for RPBG: Towards Robust Neural Point-based Graphics in the Wild.☆44Aug 22, 2024Updated last year