mbzuai-metaverse / XMem2Links
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking
☆206Updated last year
Alternatives and similar repositories for XMem2
Users that are interested in XMem2 are comparing it to the libraries listed below
Sorting:
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆269Updated last year
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆212Updated last month
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆326Updated last year
- [AAAI 2025] Elevating Flow-Guided Video Inpainting with Reference Generation☆88Updated 6 months ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆383Updated last year
- [Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers☆488Updated 4 months ago
- This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users …☆44Updated last year
- [Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models☆567Updated last year
- [ICCV 2025, Highlight] ZIM: Zero-Shot Image Matting for Anything☆383Updated 3 months ago
- Official Code for Tracking Any Object Amodally☆120Updated last year
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆152Updated last year
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception☆289Updated 3 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆59Updated 9 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆509Updated last year
- ☆105Updated last year
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆493Updated last year
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆236Updated 10 months ago
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆65Updated last year
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆192Updated 5 months ago
- Muggled SAM: Segmentation without the magic☆180Updated last week
- Mask-Free Video Instance Segmentation [CVPR 2023]☆368Updated last year
- ☆77Updated 9 months ago
- ☆63Updated 2 years ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆490Updated 9 months ago
- Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)☆260Updated 2 years ago
- The official implementation of ICCV'25 paper "FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution"☆348Updated 3 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆215Updated 11 months ago
- Dense Optical Tracking: Connecting the Dots☆314Updated last year
- ☆83Updated 6 months ago
- Dereflection Any Image with Diffusion Priors and Diversified Data [AAAI 2026]☆92Updated last month