WalBouss/GEM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WalBouss/GEM)

WalBouss / GEM

[CVPR24] Official Implementation of GEM (Grounding Everything Module)

☆139

Alternatives and similar repositories for GEM

Users that are interested in GEM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WalBouss / MaskInversion
View on GitHub
[ICLR 26] Official Implementation of MaskInversion
☆32Feb 28, 2026Updated 4 months ago
wysoczanska / clip_dinoiser
View on GitHub
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
☆285Oct 26, 2024Updated last year
sinahmr / NACLIP
View on GitHub
PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"
☆78Sep 23, 2024Updated last year
mc-lan / ProxyCLIP
View on GitHub
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
☆120Mar 26, 2025Updated last year
leaves162 / CLIPtrase
View on GitHub
cliptrase
☆47Sep 1, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zbf1991 / WeCLIP
View on GitHub
CVPR2024
☆110Mar 12, 2025Updated last year
SuleBai / SC-CLIP
View on GitHub
[TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
☆72Mar 27, 2026Updated 3 months ago
aimagelab / freeda
View on GitHub
FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)
☆50Aug 28, 2024Updated last year
slonetime / EBSeg
View on GitHub
[CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
☆41Jan 12, 2026Updated 6 months ago
paulgavrikov / visualoverload
View on GitHub
VisualOverload (CVPR 2026) is a VQA benchmark for image understanding in dense, high-resolution scenes.
☆18May 31, 2026Updated last month
wangf3014 / SCLIP
View on GitHub
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
☆192Updated this week
danielchyeh / this-is-my
View on GitHub
Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated 2 years ago
Vibashan / PosSAM
View on GitHub
Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything
☆71Apr 7, 2024Updated 2 years ago
sinahmr / LocAtViT
View on GitHub
PyTorch Implementation of LocAtViT in "Locality-Attending Vision Transformer" (ICLR 2026)
☆18Mar 10, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
WalBouss / SenFormer
View on GitHub
This is the official repo of SenFormer [BMVC'22]
☆72Dec 3, 2023Updated 2 years ago
mlpc-ucsd / MasQCLIP
View on GitHub
(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation
☆37Oct 18, 2023Updated 2 years ago
jiaosiyu1999 / MAFT-Plus
View on GitHub
☆60Sep 14, 2024Updated last year
xmed-lab / CLIP_Surgery
View on GitHub
[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
☆479Mar 1, 2025Updated last year
NVlabs / PerVLBenchmark
View on GitHub
☆11Jul 31, 2022Updated 3 years ago
SMILE-data / SMILE
View on GitHub
SMILE: A Multimodal Dataset for Understanding Laughter
☆13Jun 15, 2023Updated 3 years ago
linyq2117 / TagCLIP
View on GitHub
[AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
☆115Jan 9, 2024Updated 2 years ago
yongliu20 / SCAN
View on GitHub
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆77Sep 23, 2024Updated last year
mlpc-ucsd / MaskCLIP
View on GitHub
Code Release for MaskCLIP (ICML 2023)
☆78Nov 29, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WalBouss / LeGrad
View on GitHub
[ICCV25] Official Implementation of LeGrad
☆99Oct 14, 2024Updated last year
wysoczanska / clip-diy
View on GitHub
Official implementation of the WACV 2024 paper CLIP-DIY
☆34Dec 20, 2023Updated 2 years ago
dahyun-kang / lavg
View on GitHub
[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
☆51Sep 24, 2024Updated last year
ninatu / howtocaption
View on GitHub
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆58Aug 19, 2025Updated 11 months ago
google / storybench
View on GitHub
☆55Oct 16, 2023Updated 2 years ago
m-arda-aydn / ITACLIP
View on GitHub
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]
☆24Jan 31, 2026Updated 5 months ago
yxchng / mask-grounding
View on GitHub
[CVPR2024] Mask Grounding for Referring Image Segmentation
☆29Jul 22, 2024Updated 2 years ago
ilkerkesen / ViLMA
View on GitHub
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
☆16Jan 18, 2024Updated 2 years ago
HVision-NKU / Cascade-CLIP
View on GitHub
Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
☆58Aug 15, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bytedance / fc-clip
View on GitHub
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…
☆345Feb 5, 2024Updated 2 years ago
Xujxyang / OpenTrans
View on GitHub
☆26Apr 17, 2024Updated 2 years ago
ATR-DBI / Cross3DVG
View on GitHub
☆12May 5, 2024Updated 2 years ago
nikosips / UDON
View on GitHub
☆11Nov 18, 2024Updated last year
chongzhou96 / MaskCLIP
View on GitHub
Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)
☆480Sep 19, 2022Updated 3 years ago
berkeley-hipie / HIPIE
View on GitHub
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
☆294Jun 19, 2025Updated last year
cvlab-kaist / CAT-Seg
View on GitHub
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
☆385Apr 11, 2024Updated 2 years ago