IDEA-Research / TAPTRLinks
[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
☆263Updated 5 months ago
Alternatives and similar repositories for TAPTR
Users that are interested in TAPTR are comparing it to the libraries listed below
Sorting:
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆307Updated 5 months ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆431Updated 5 months ago
- [CVPR 2025] Video Depth without Video Models☆538Updated 2 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆482Updated 6 months ago
- Orient Anything, ICML 2025☆276Updated 2 weeks ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆277Updated 3 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆209Updated 2 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆173Updated last month
- Scaling Vision Pre-Training to 4K Resolution☆161Updated last month
- [CVPR 2025] Code for Segment Any Motion in Videos☆352Updated 2 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆248Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆411Updated 2 months ago
- PIPs++☆306Updated 10 months ago
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆337Updated 8 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 10 months ago
- [CVPR'24] Group Anything with Radiance Fields☆427Updated 7 months ago
- Official Code for Tracking Any Object Amodally☆118Updated 10 months ago
- Muggled SAM: Segmentation without the magic☆139Updated last month
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆488Updated 6 months ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆363Updated 6 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆311Updated 10 months ago
- [CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation☆73Updated last month
- ZIM: Zero-Shot Image Matting for Anything☆284Updated 6 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆189Updated 4 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆253Updated 7 months ago
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆671Updated 2 months ago
- A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking☆198Updated last year
- Grounded Tracking for Streaming Videos☆104Updated 7 months ago
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆277Updated last week
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆233Updated 3 months ago