alimohammadiamirhossein / smite
Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)
☆202Updated this week
Alternatives and similar repositories for smite:
Users that are interested in smite are comparing it to the libraries listed below
- ☆230Updated last month
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆292Updated 2 months ago
- Depth Any Video with Scalable Synthetic Data☆451Updated 2 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆254Updated 2 months ago
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆155Updated 9 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆262Updated 3 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆156Updated 3 months ago
- [WACV 2025] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆388Updated 2 months ago
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆333Updated 5 months ago
- A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking☆189Updated 11 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆152Updated 3 weeks ago
- Prompt Depth Anything☆549Updated this week
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆46Updated 3 months ago
- GenXD: Generating Any 3D and 4D Scenes☆175Updated 3 weeks ago
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆324Updated 2 months ago
- Video Depth without Video Models☆446Updated 2 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆216Updated 2 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 7 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆464Updated 2 months ago
- ☆226Updated last week
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆360Updated last month
- Muggled SAM: Segmentation without the magic☆99Updated 2 weeks ago
- Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"☆163Updated 8 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆221Updated 3 months ago
- Official Code for Tracking Any Object Amodally☆116Updated 7 months ago
- ZIM: Zero-Shot Image Matting for Anything☆254Updated 3 months ago
- [arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos☆339Updated last month
- Official pytorch implementation of "XHand: Real-time Expressive Hand Avatar"☆76Updated 6 months ago
- ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).☆356Updated last month