SAM 2++: Tracking Anything at Any Granularity
☆64Dec 15, 2025Updated 5 months ago
Alternatives and similar repositories for SAM2-Plus
Users that are interested in SAM2-Plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for "Finding NeMO: A Geometry-Aware Representation of Template Views for Few-Shot Perception"☆28Apr 28, 2026Updated 3 weeks ago
- [ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"☆29Nov 27, 2025Updated 5 months ago
- ☆14May 25, 2024Updated 2 years ago
- ☆22Mar 7, 2025Updated last year
- GHUStereo models are novel real-time stereo matching architectures with a low computation complexity characterized by compact cost volum…☆31Dec 14, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2026] UniCorrn: Unified Correspondence Transformer Across 2D and 3D☆172May 6, 2026Updated 2 weeks ago
- [CVPR 2026 Highlight] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆80Mar 25, 2026Updated 2 months ago
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"☆55Jul 5, 2025Updated 10 months ago
- Video Depth Propagation [3DV 2026]☆36Jan 23, 2026Updated 4 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last month
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆63Mar 23, 2026Updated 2 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆31Mar 13, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆13May 26, 2017Updated 8 years ago
- Self-training LLaVA for medical☆16Nov 3, 2024Updated last year
- [ICLR 2025] 🏄 OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆92Aug 4, 2025Updated 9 months ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆32Feb 6, 2026Updated 3 months ago
- ☆11Dec 29, 2021Updated 4 years ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago
- Offical implementation of work 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation☆19Feb 5, 2025Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆98Jul 5, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- non-rigid registration in NIMBLE: A Non-rigid Hand Model with Bones and Muscles☆11Sep 2, 2022Updated 3 years ago
- [3DV 2026] GIGA: Generalizable Sparse Image-driven Gaussian Humans☆17Jan 28, 2026Updated 3 months ago
- This library implements functions and classes for mesh registration, data augmentation, and data normalisation.☆12Oct 7, 2024Updated last year
- Expanded Adaptive Scaling Normalization for End to End Image Compression☆10Sep 4, 2025Updated 8 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [ICCV 2025] CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image☆22Updated this week
- Algorithms for face super resolution implemented in Pytorch.☆13Feb 9, 2021Updated 5 years ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views (ICCV2023)☆14Oct 9, 2023Updated 2 years ago
- CroCoDL (CVPR 2025) fork containing the additions on top of LaMAR (ECCV 2022)☆30Apr 8, 2026Updated last month
- [AAAI 2025] Video Diffusion Models are Strong Video Inpainter☆16Jul 21, 2025Updated 10 months ago
- N-dimensional Rotary Position Embeddings for PyTorch☆84Feb 14, 2024Updated 2 years ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆56Dec 28, 2025Updated 4 months ago
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆130Apr 7, 2026Updated last month
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year