ClaudiaCuttano / SAMWISELinks
[CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆307Updated last month
Alternatives and similar repositories for SAMWISE
Users that are interested in SAMWISE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆325Updated 3 weeks ago
- Official repository for the paper "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆106Updated last month
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆151Updated 4 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆369Updated 2 months ago
- CAVIS: Context-Aware Video Instance Segmentation☆89Updated 3 weeks ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆69Updated 3 weeks ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆451Updated 5 months ago
- Scaling Vision Pre-Training to 4K Resolution☆198Updated 3 weeks ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆62Updated 2 weeks ago
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆98Updated 2 weeks ago
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆480Updated 2 months ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆358Updated 11 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆211Updated 4 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 5 months ago
- Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning☆227Updated 3 weeks ago
- Muggled SAM: Segmentation without the magic☆154Updated this week
- [Fully open] [Encoder-free MLLM] Vision as LoRA☆333Updated 2 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆259Updated 4 months ago
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆271Updated 2 months ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆256Updated 10 months ago
- Efficient Track Anything☆623Updated 7 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆83Updated 2 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆762Updated 2 months ago
- ☆125Updated last year
- Official Code for Tracking Any Object Amodally☆118Updated last year
- [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆290Updated 3 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆77Updated 2 months ago
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆47Updated 8 months ago
- ☆103Updated 4 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆106Updated last month