ClaudiaCuttano / SAMWISE
[CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆173Updated last month
Alternatives and similar repositories for SAMWISE:
Users that are interested in SAMWISE are comparing it to the libraries listed below
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆131Updated this week
- CAVIS: Context-Aware Video Instance Segmentation☆86Updated 3 weeks ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 2 months ago
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆122Updated 3 weeks ago
- The official implementation of the paper "ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations".☆38Updated 3 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆281Updated last month
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆70Updated 3 weeks ago
- Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆62Updated last month
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆51Updated 2 months ago
- ☆117Updated 10 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆88Updated last month
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆38Updated 5 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆244Updated 6 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆88Updated 2 months ago
- Scaling Vision Pre-Training to 4K Resolution☆155Updated last week
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆56Updated last year
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆62Updated last year
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆91Updated 2 months ago
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆36Updated 5 months ago
- ☆68Updated last month
- [Fully open] [Encoder-free MLLM] Vision as LoRA☆146Updated 3 weeks ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆121Updated 3 weeks ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆42Updated 4 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆43Updated 7 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆208Updated last month
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Updated 2 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆76Updated 2 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆47Updated last week
- The official implementation of "Segment Anything with Multiple Modalities".☆93Updated 8 months ago