mbzuai-metaverse / XMem2
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking
☆189Updated 11 months ago
Alternatives and similar repositories for XMem2:
Users that are interested in XMem2 are comparing it to the libraries listed below
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆292Updated 2 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆202Updated this week
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆254Updated 2 months ago
- [Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers☆391Updated 8 months ago
- Depth Any Video with Scalable Synthetic Data☆451Updated 2 months ago
- Muggled SAM: Segmentation without the magic☆99Updated 2 weeks ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆156Updated 3 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 7 months ago
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆155Updated 9 months ago
- ☆88Updated 7 months ago
- ☆104Updated 7 months ago
- [Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models☆514Updated 8 months ago
- Official Code for Tracking Any Object Amodally☆116Updated 7 months ago
- [AAAI 2025] Elevating Flow-Guided Video Inpainting with Reference Generation☆46Updated 2 months ago
- ☆58Updated last year
- PIPs++☆303Updated 7 months ago
- [CVPR24] MaGGIe: Mask Guided Gradual Human Instance Matting☆57Updated last month
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆46Updated 3 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆360Updated last month
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆216Updated 2 months ago
- ZIM: Zero-Shot Image Matting for Anything☆254Updated 3 months ago
- [WACV 2025] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆388Updated 2 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆52Updated 3 weeks ago
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆127Updated 3 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆262Updated 3 months ago
- Deficiency-Aware Masked Transformer for Video Inpainting☆53Updated last year
- Dense Optical Tracking: Connecting the Dots☆270Updated 3 months ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆344Updated 2 months ago
- ☆230Updated last month
- ☆226Updated last week