sajjad-sh33 / ViT-PLinks
The Missing Point in Vision Transformers for Universal Image Segmentation
☆55Updated this week
Alternatives and similar repositories for ViT-P
Users that are interested in ViT-P are comparing it to the libraries listed below
Sorting:
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆90Updated 4 months ago
- ☆33Updated last month
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆120Updated last month
- ☆129Updated last year
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆261Updated 7 months ago
- ☆69Updated last year
- ☆93Updated last year
- Open-Vocabulary Panoptic Segmentation☆27Updated 5 months ago
- Multi-Class Few-Shot Semantic Segmentation with Visual Prompts☆66Updated 3 weeks ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆69Updated last year
- [AAAI 2026] Code for "SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation".☆144Updated this week
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆142Updated 5 months ago
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆476Updated 2 weeks ago
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"☆162Updated last year
- DVIS: Decoupled Video Instance Segmentation Framework☆154Updated last year
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆69Updated 3 months ago
- One summary of efficient segment anything models☆109Updated last year
- ☆20Updated last year
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆111Updated 5 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆106Updated last year
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆87Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 8 months ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆91Updated last month
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆175Updated 11 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆90Updated 6 months ago
- [ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…☆136Updated this week
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆73Updated 6 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆51Updated 6 months ago
- Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)☆344Updated 3 weeks ago