mbzuai-metaverse / XMem2
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking
☆179Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for XMem2
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆272Updated 2 weeks ago
- Pytorch Implementation of "SMITE: Segment Me In TimE"☆119Updated 2 weeks ago
- [ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"☆199Updated 2 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆135Updated last week
- TrailBlazer: Trajectory Control for Diffusion-Based Video Generation☆91Updated 5 months ago
- [Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers☆370Updated 5 months ago
- [Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models☆485Updated 5 months ago
- Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]☆272Updated 8 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆111Updated 4 months ago
- Official Implementation of LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆454Updated last week
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆425Updated 4 months ago
- Depth Any Video with Scalable Synthetic Data☆390Updated 2 weeks ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆192Updated 4 months ago
- Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think. Accepted to WACV 2025 and NeurIPS AFM Workshop.☆326Updated last week
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆499Updated 2 months ago
- ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).☆330Updated 3 months ago
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆235Updated 10 months ago
- MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild (CVPR2024 Oral)☆206Updated 4 months ago
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆145Updated 8 months ago
- Muggled SAM: Segmentation without the magic☆54Updated this week
- Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)☆195Updated last week
- Official repo for VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads.☆147Updated last month
- Official Code for Tracking Any Object Amodally☆113Updated 4 months ago
- Dense Optical Tracking: Connecting the Dots☆252Updated 7 months ago
- ☆93Updated 4 months ago
- [IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance☆184Updated 8 months ago
- 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting☆179Updated 5 months ago
- Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting☆158Updated 2 months ago
- ☆215Updated 2 months ago
- Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)☆218Updated last year