SuleBai/SC-CLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SuleBai/SC-CLIP)

SuleBai / SC-CLIP

[TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation

☆72

Alternatives and similar repositories for SC-CLIP

Users that are interested in SC-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndyTang15 / FLAG3Dv2
View on GitHub
☆25May 9, 2024Updated 2 years ago
AndyTang15 / FLAG3D
View on GitHub
☆19Jun 22, 2026Updated 3 weeks ago
shiyi-zh0408 / NAE_CVPR2024
View on GitHub
[CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
☆43May 16, 2024Updated 2 years ago
yongliu20 / SCAN
View on GitHub
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆77Sep 23, 2024Updated last year
zhang9302002 / ThinkingWithVideos
View on GitHub
The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"
☆101Oct 15, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
VoyageWang / IteRPrimE
View on GitHub
The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…
☆20Apr 6, 2025Updated last year
InvincibleWyq / ChatVID
View on GitHub
Chat about anything on any video!
☆39Sep 5, 2023Updated 2 years ago
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
leaves162 / CLIPtrase
View on GitHub
cliptrase
☆47Sep 1, 2024Updated last year
zdk258 / CorrCLIP
View on GitHub
[ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
☆70Aug 1, 2025Updated 11 months ago
dahyun-kang / lavg
View on GitHub
[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
☆51Sep 24, 2024Updated last year
yongliu20 / Awesome-Unified-Understanding-and-Generation
View on GitHub
☆52Aug 22, 2025Updated 10 months ago
AMAP-ML / UniVG-R1
View on GitHub
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
☆165Jun 2, 2025Updated last year
Tengbo-Yu / AnyBimanual
View on GitHub
[ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation
☆103Jun 26, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Yxxxb / VoCo-LLaMA
View on GitHub
[CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".
☆205Jun 18, 2025Updated last year
EternalEvan / DPMesh
View on GitHub
The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery", CVPR 2024
☆45Jun 4, 2024Updated 2 years ago
MICV-yonsei / CASS
View on GitHub
[CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
☆50Mar 27, 2025Updated last year
jiaosiyu1999 / MAFT-Plus
View on GitHub
☆60Sep 14, 2024Updated last year
hustvl / MaskAdapter
View on GitHub
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
☆135Oct 23, 2025Updated 8 months ago
fhgyuanshen / HybridGL
View on GitHub
[CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation
☆37Jun 27, 2025Updated last year
sinahmr / LocAtViT
View on GitHub
PyTorch Implementation of LocAtViT in "Locality-Attending Vision Transformer" (ICLR 2026)
☆18Mar 10, 2026Updated 4 months ago
shiyi-zh0408 / LOGO
View on GitHub
[CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
☆48Apr 9, 2024Updated 2 years ago
linsun449 / cliper.code
View on GitHub
This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…
☆41Sep 10, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zhu-xlab / GlobalBuildingMap
View on GitHub
☆16Dec 15, 2025Updated 7 months ago
mc-lan / ProxyCLIP
View on GitHub
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
☆120Mar 26, 2025Updated last year
yliu1229 / AlignSeg
View on GitHub
The PyTorch implementation of AlignSeg.
☆21Feb 26, 2025Updated last year
IVGSZ / Flash-VStream
View on GitHub
This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"
☆285Oct 15, 2025Updated 9 months ago
shiyi-zh0408 / Meta-CoT
View on GitHub
[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"
☆78May 6, 2026Updated 2 months ago
RammusLeo / DPMesh
View on GitHub
The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"
☆25Jul 25, 2024Updated last year
Jixuan-Fan / Momentum-GS
View on GitHub
[ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction
☆173Dec 15, 2025Updated 7 months ago
jdg900 / MMR
View on GitHub
[ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…
☆28Apr 3, 2025Updated last year
YuHengsss / Trident
View on GitHub
[ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation
☆125Nov 22, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆20Dec 9, 2025Updated 7 months ago
kaist-cvml / part-clipseg
View on GitHub
[NeurIPS 2024] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
☆62Dec 29, 2024Updated last year
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
WalBouss / GEM
View on GitHub
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆139Apr 10, 2025Updated last year
DaehanKim-Korea / VisDA2022_1st_Place_Solution
View on GitHub
☆11Jun 3, 2023Updated 3 years ago
m-arda-aydn / ITACLIP
View on GitHub
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]
☆24Jan 31, 2026Updated 5 months ago
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago