MCG-NJU / SAM2-PlusLinks
SAM 2++: Tracking Anything at Any Granularity
☆38Updated 2 weeks ago
Alternatives and similar repositories for SAM2-Plus
Users that are interested in SAM2-Plus are comparing it to the libraries listed below
Sorting:
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆11Updated last year
- ☆24Updated 7 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆97Updated 7 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆24Updated 5 months ago
- [AAAI 2025] Video Diffusion Models are Strong Video Inpainter☆16Updated 4 months ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆28Updated 5 months ago
- Official implementation of DepthLM☆260Updated last month
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 7 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆34Updated 5 months ago
- ☆26Updated 7 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆200Updated last month
- Seeing World Dynamics in a Nutshell☆110Updated 8 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆55Updated 8 months ago
- ☆113Updated 5 months ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆59Updated 2 weeks ago
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆139Updated last month
- open-sourced video dataset with dynamic scenes and camera movements annotation☆78Updated 6 months ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆119Updated last week
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆55Updated 6 months ago
- ☆35Updated 6 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆25Updated 3 weeks ago
- VideoDirector [CVPR 2025]☆32Updated 7 months ago
- [CVPR 2025] Open-World Amodal Appearance Completion☆41Updated last week
- This is the official implementation of work HiM2SAM in PRCV25.☆19Updated 2 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆73Updated 2 months ago
- Official implementation of our paper "Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images"☆71Updated 5 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆56Updated 3 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆58Updated 4 months ago
- ☆27Updated 7 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆42Updated 3 months ago