IDEA-Research / TAPTR
[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
☆262Updated 4 months ago
Alternatives and similar repositories for TAPTR:
Users that are interested in TAPTR are comparing it to the libraries listed below
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆304Updated 4 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆474Updated 5 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆166Updated 2 weeks ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆208Updated last month
- ☆265Updated 3 weeks ago
- [CVPR 2025] Video Depth without Video Models☆505Updated last month
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆274Updated 2 months ago
- PIPs++☆305Updated 9 months ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆422Updated 4 months ago
- Muggled SAM: Segmentation without the magic☆131Updated 3 weeks ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆359Updated 5 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆319Updated last month
- [CVPR'24] Group Anything with Radiance Fields☆414Updated 6 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆243Updated 3 weeks ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 9 months ago
- Official Code for Tracking Any Object Amodally☆117Updated 9 months ago
- Grounded Tracking for Streaming Videos☆101Updated 6 months ago
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆666Updated last month
- Dense Optical Tracking: Connecting the Dots☆284Updated 5 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆484Updated 5 months ago
- Scaling Vision Pre-Training to 4K Resolution☆154Updated last week
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆184Updated 3 months ago
- This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arx…☆143Updated last month
- ☆78Updated 3 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆248Updated 6 months ago
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆335Updated 7 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆281Updated 3 weeks ago
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆165Updated last year
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆230Updated 2 months ago
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆138Updated 6 months ago