ExplainableML/flair

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ExplainableML/flair)

ExplainableML / flair

[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations

☆147

Alternatives and similar repositories for flair

Users that are interested in flair are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ExplainableML / cosmos
View on GitHub
[CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
☆42Mar 27, 2025Updated last year
ExplainableML / finer
View on GitHub
[CVPR 2026 Oral] FINER: MLLMs Hallucinate under Fine-grained Negative Queries
☆17Jul 6, 2026Updated 2 weeks ago
tiiuae / FineLIP
View on GitHub
code for FineLIP
☆43Nov 25, 2025Updated 7 months ago
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
lezhang7 / SAIL
View on GitHub
[CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"
☆60Aug 15, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ExplainableML / DeLoRA
View on GitHub
[ICLR25] Official Implementation of "Decoupling Angles and Strength in Low-rank Adaptation"
☆15Dec 12, 2025Updated 7 months ago
ExplainableML / TiViT
View on GitHub
Time Vision Transformer
☆24Jul 20, 2025Updated last year
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
flrneha / ElasticBasisForSpectralMatching
View on GitHub
Accompanying code for "An Elastic Basis for Spectral Shape Correspondence"
☆12Aug 2, 2023Updated 2 years ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
ExplainableML / DeViL
View on GitHub
GCPR 2023 - DeViL: Decoding Vision features into Language
☆12Oct 16, 2023Updated 2 years ago
miccunifi / Cross-the-Gap
View on GitHub
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
☆69Nov 30, 2025Updated 7 months ago
m1k2zoo / negbench
View on GitHub
Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"
☆47Feb 26, 2026Updated 4 months ago
rabiulcste / vismin
View on GitHub
[NeurIPS24] VisMin: Visual Minimal-Change Understanding
☆19Mar 3, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Becomebright / MTV
View on GitHub
Revisiting Multi-Task Visual Representation Learning
☆22Jan 21, 2026Updated 6 months ago
xjjxmu / TextRefiner
View on GitHub
The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]
☆53Mar 13, 2025Updated last year
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
zhuole1025 / LLMs_as_Visual_Explainers
View on GitHub
Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"
☆15Apr 20, 2025Updated last year
rootyJeon / Vision-aligned-Latent-Reasoning
View on GitHub
[ICML 2026] Official implementation of Vision-aligned Latent Reasoning for Multi-modal Large Language Model (VaLR)
☆20Apr 30, 2026Updated 2 months ago
Mid-Push / SmartCLIP
View on GitHub
SmartCLIP: A training method to improve CLIP with both short and long texts
☆43Jun 18, 2025Updated last year
microsoft / klite
View on GitHub
[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222
☆54Jun 12, 2023Updated 3 years ago
wangguanan / DenoiseRep
View on GitHub
[NeurIPS2024 Oral] PyTorch implementation of DenoiseRep
☆35Sep 23, 2025Updated 9 months ago
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆22Oct 8, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ASGMVLP / ASGMVLP_CODE
View on GitHub
The repo of ASGMVLP
☆19Jan 16, 2026Updated 6 months ago
RAIVNLab / CREPE
View on GitHub
[CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?
☆35Apr 27, 2023Updated 3 years ago
ExplainableML / Vision_by_Language
View on GitHub
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
☆89Jul 4, 2024Updated 2 years ago
vec-ai / wikiHow-TIIR
View on GitHub
[ACL 2025] Towards Text-Image Interleaved Retrieval
☆16Sep 3, 2025Updated 10 months ago
worldbench / SPIRAL
View on GitHub
[NeurIPS 2025] SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding
☆44Jul 8, 2026Updated last week
emu1729 / GIST
View on GitHub
Generating Image Specific Text
☆29Aug 14, 2023Updated 2 years ago
MIV-XJTU / FLAME
View on GitHub
[CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"
☆33Jul 8, 2025Updated last year
yuanqing-ai / LLM-Hierarchical-Consistency
View on GitHub
Official implementation of "Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck"
☆16Nov 10, 2025Updated 8 months ago
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hmchuong / CoLLM
View on GitHub
[CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval
☆28Mar 26, 2025Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
altndrr / vic
View on GitHub
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
☆107Feb 2, 2024Updated 2 years ago
ErgastiAlex / MARS
View on GitHub
☆37Mar 28, 2025Updated last year
dongliangcao / Spectral-Meets-Spatial
View on GitHub
CVPR24: Spectral Meets Spatial: Harmonising 3D Shape Matching and Interpolation
☆19Jul 4, 2024Updated 2 years ago
HKU-MedAI / HERGen
View on GitHub
[ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data
☆31Jan 25, 2026Updated 5 months ago
ExplainableML / EgoCVR
View on GitHub
[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
☆41Apr 11, 2025Updated last year