Qinying-Liu/TagAlign

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Qinying-Liu/TagAlign)

Qinying-Liu / TagAlign

Official implementation of TagAlign

☆35

Alternatives and similar repositories for TagAlign

Users that are interested in TagAlign are comparing it to the libraries listed below

Sorting:

muyangyi / SimSeg
View on GitHub
[CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation
☆59Jan 26, 2025Updated last year
Jazzcharles / OVSegmentor
View on GitHub
OVSegmentor, CVPR23
☆61Apr 22, 2024Updated last year
mlfoundations / clip_quality_not_quantity
View on GitHub
☆29Oct 18, 2022Updated 3 years ago
OVAD-Benchmark / ovad-benchmark-code
View on GitHub
OVAD: Open-vocabulary Attribute Detection code
☆31Aug 28, 2023Updated 2 years ago
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated 9 months ago
wuw2019 / R-AMT
View on GitHub
☆20Oct 19, 2023Updated 2 years ago
wangf3014 / SCLIP
View on GitHub
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
☆182Oct 10, 2024Updated last year
ldynx / SAVE
View on GitHub
☆25Nov 22, 2024Updated last year
xinyu1205 / robust-loss-mlml
View on GitHub
Code for paper: Simple and Robust Loss Design for Multi-Label Learning with Missing Labels
☆52Apr 11, 2023Updated 2 years ago
mightyzau / RegionBLIP
View on GitHub
☆58Aug 7, 2023Updated 2 years ago
ImKeTT / ZeroGen
View on GitHub
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
☆14Oct 7, 2023Updated 2 years ago
microsoft / A-CLIP
View on GitHub
Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)
☆36May 29, 2024Updated last year
vinid / neg_clip
View on GitHub
NegCLIP.
☆39Feb 6, 2023Updated 3 years ago
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
jiyounglee-0523 / VisAlign
View on GitHub
☆20Apr 23, 2024Updated last year
deepglint / ALIP
View on GitHub
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
☆105Sep 18, 2023Updated 2 years ago
YuchenLiu98 / COMM
View on GitHub
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
☆207Jan 8, 2025Updated last year
xmed-lab / CLIP_Surgery
View on GitHub
[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
☆465Mar 1, 2025Updated last year
baaivision / DenseFusion
View on GitHub
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
☆159Dec 6, 2024Updated last year
x-cls / superclass
View on GitHub
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
☆226Mar 20, 2025Updated 11 months ago
kdwonn / SaG
View on GitHub
Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"
☆42Jan 29, 2024Updated 2 years ago
yoon307 / CTI
View on GitHub
Official code for Class Tokens Infusion for Weakly Supervised Semantic Segmentation, CVPR2024
☆23Oct 26, 2024Updated last year
zlai0 / S-Seg
View on GitHub
☆23Jan 24, 2024Updated 2 years ago
jaeseokbyun / GRIT-VLP
View on GitHub
This is an official implementation of GRIT-VLP
☆20Aug 8, 2022Updated 3 years ago
toggle1995 / RIS-DMMI
View on GitHub
☆45Oct 3, 2023Updated 2 years ago
HKUST-LongGroup / CoMM
View on GitHub
Official repository for CoMM Dataset
☆50Dec 31, 2024Updated last year
amazon-science / prompt-pretraining
View on GitHub
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
☆259May 3, 2024Updated last year
mightyzau / InfMLLM
View on GitHub
☆19Dec 6, 2023Updated 2 years ago
Xujxyang / OpenTrans
View on GitHub
☆24Apr 17, 2024Updated last year
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆50Jan 14, 2025Updated last year
wysoczanska / clip_dinoiser
View on GitHub
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
☆275Oct 26, 2024Updated last year
jmiemirza / MATE
View on GitHub
MATE: Masked Autoencoders are Online 3D Test-Time Learners (ICCV 2023)
☆22Jul 22, 2023Updated 2 years ago
threedle / hyperfields
View on GitHub
☆22Dec 11, 2024Updated last year
KishoreP1 / DetailCLIP
View on GitHub
Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)
☆57Mar 26, 2025Updated 11 months ago
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆61Jul 16, 2024Updated last year
xing0047 / rewrite
View on GitHub
[NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
☆20Jan 3, 2024Updated 2 years ago
jnypark / VideoMamba
View on GitHub
☆27Jun 4, 2024Updated last year
yiren-jian / BLIText
View on GitHub
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
☆27Dec 5, 2023Updated 2 years ago
chenxi52 / CMPF
View on GitHub
Open-Vocabulary Panoptic Segmentation
☆27Jun 15, 2025Updated 8 months ago