mbzuai-metaverse / XMem2
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking
☆191Updated last year
Alternatives and similar repositories for XMem2:
Users that are interested in XMem2 are comparing it to the libraries listed below
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆299Updated 3 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆260Updated 3 months ago
- [Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers☆401Updated 10 months ago
- This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users …☆39Updated 6 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆206Updated last month
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆459Updated 3 months ago
- ☆204Updated last month
- ☆91Updated 8 months ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆352Updated 3 months ago
- [AAAI 2025] Elevating Flow-Guided Video Inpainting with Reference Generation☆61Updated last week
- ☆61Updated last year
- Muggled SAM: Segmentation without the magic☆115Updated last month
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆57Updated 3 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 8 months ago
- Coherent Video Inpainting Using Optical Flow-Guided Efficient Diffusion☆39Updated this week
- [Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models☆522Updated 9 months ago
- [CVPR24] MaGGIe: Mask Guided Gradual Human Instance Matting☆59Updated 2 months ago
- (CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"☆94Updated 11 months ago
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆133Updated 5 months ago
- [CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation☆64Updated 2 weeks ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆54Updated 3 weeks ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆162Updated 4 months ago
- ZIM: Zero-Shot Image Matting for Anything☆266Updated 4 months ago
- ☆111Updated 8 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆243Updated last week
- Official Pytorch Implementation for "VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with …☆118Updated last year
- ☆66Updated 8 months ago
- ☆104Updated last year
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆219Updated 3 weeks ago
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆160Updated 10 months ago