SAM 2++: Tracking Anything at Any Granularity
☆67Dec 15, 2025Updated 6 months ago
Alternatives and similar repositories for SAM2-Plus
Users that are interested in SAM2-Plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for "Finding NeMO: A Geometry-Aware Representation of Template Views for Few-Shot Perception"☆31Apr 28, 2026Updated 2 months ago
- [ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"☆31Nov 27, 2025Updated 7 months ago
- ☆22Mar 7, 2025Updated last year
- [CVPR 2026 Highlight] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆92Jun 18, 2026Updated 2 weeks ago
- Video Depth Propagation [3DV 2026]☆38Jan 23, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆30Oct 19, 2025Updated 8 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 4 months ago
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆64Mar 23, 2026Updated 3 months ago
- ☆218Jun 15, 2026Updated 2 weeks ago
- Self-training LLaVA for medical☆16Nov 3, 2024Updated last year
- [ICLR 2025] 🏄 OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆94Aug 4, 2025Updated 11 months ago
- ☆55Mar 17, 2025Updated last year
- ☆12Dec 29, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated 2 years ago
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆39Apr 17, 2025Updated last year
- [ICCV 2023] Robust Object Modeling for Visual Tracking, Official Implementation☆47Jan 5, 2025Updated last year
- Offical implementation of work 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation☆19Feb 5, 2025Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆97Jul 5, 2024Updated 2 years ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (ICML 2026)☆43Jun 26, 2026Updated last week
- non-rigid registration in NIMBLE: A Non-rigid Hand Model with Bones and Muscles☆11Sep 2, 2022Updated 3 years ago
- [CVPR25] SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs☆19Aug 27, 2025Updated 10 months ago
- This library implements functions and classes for mesh registration, data augmentation, and data normalisation.☆12Oct 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of the model architecture for SRT-H☆28Jun 20, 2026Updated 2 weeks ago
- Official Code for CVPR2025 Paper: LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion☆33May 4, 2026Updated 2 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆18Feb 3, 2025Updated last year
- [ICCV 2025] CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image☆22May 20, 2026Updated last month
- 📚 2025 Scene Graph ArXiv Paper List — Updated Daily☆16Mar 18, 2026Updated 3 months ago
- Algorithms for face super resolution implemented in Pytorch.☆13Feb 9, 2021Updated 5 years ago
- [ECCV 2024] Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation☆36Jan 6, 2025Updated last year
- [ECCV 2022] Tackling Background Distraction in Video Object Segmentation☆40Jun 2, 2025Updated last year
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆70Jun 23, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views (ICCV2023)☆14Oct 9, 2023Updated 2 years ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 6 months ago
- CroCoDL (CVPR 2025) fork containing the additions on top of LaMAR (ECCV 2022)☆29Apr 8, 2026Updated 2 months ago
- [AAAI 2025] Video Diffusion Models are Strong Video Inpainter☆16Jul 21, 2025Updated 11 months ago
- N-dimensional Rotary Position Embeddings for PyTorch☆85Feb 14, 2024Updated 2 years ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆58Dec 28, 2025Updated 6 months ago
- Official implementation for the CVPR 2024 paper: HuProSO3: Normalizing Flows on the Product Space of SO(3) Manifolds for Probabilistic Hu…☆15Mar 31, 2025Updated last year