mbzuai-metaverse / XMem2
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking
☆180Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for XMem2
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆273Updated 3 weeks ago
- [ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"☆200Updated 3 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE"☆161Updated 3 weeks ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆193Updated 4 months ago
- TrailBlazer: Trajectory Control for Diffusion-Based Video Generation☆91Updated 5 months ago
- [Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers☆370Updated 5 months ago
- Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]☆272Updated 8 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆136Updated 2 weeks ago
- Depth Any Video with Scalable Synthetic Data☆398Updated 3 weeks ago
- Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think. Accepted to WACV 2025 and NeurIPS AFM Workshop.☆334Updated this week
- MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild (CVPR2024 Oral)☆210Updated 5 months ago
- Official PyTorch implementation of DiffTF (Accepted by ICLR2024)☆189Updated 4 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆63Updated this week
- ☆42Updated 4 months ago
- ☆57Updated last year
- Official Implementation of LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆486Updated last week
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆138Updated 6 months ago
- Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in…☆172Updated 3 weeks ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 4 months ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆320Updated 3 weeks ago
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆515Updated 3 months ago
- Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""☆151Updated 11 months ago
- ☆221Updated 2 months ago
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆432Updated 4 months ago
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆43Updated 2 weeks ago
- Synthesizing Moving People with 3D Control☆119Updated 10 months ago
- Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)☆197Updated 3 weeks ago
- Muggled SAM: Segmentation without the magic☆58Updated last week
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆101Updated 3 weeks ago
- Official implementation of L-MAGIC☆124Updated 3 months ago