linzhiqiu/CLIP-FlanT5

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linzhiqiu/CLIP-FlanT5)

linzhiqiu / CLIP-FlanT5

Training code for CLIP-FlanT5

☆31

Alternatives and similar repositories for CLIP-FlanT5

Users that are interested in CLIP-FlanT5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linzhiqiu / t2v_metrics
View on GitHub
Evaluating text-to-image/video/3D models with VQAScore
☆598Jun 5, 2026Updated last month
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
google-deepmind / geckonum_benchmark_t2i
View on GitHub
GeckoNum Benchmark for T2I Model Eval.
☆15Dec 5, 2024Updated last year
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
casiatao / LPO
View on GitHub
The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.
☆19May 22, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
McGill-NLP / diffusion-itm
View on GitHub
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Mar 15, 2024Updated 2 years ago
paulgavrikov / biases_vs_generalization
View on GitHub
Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".
☆13Jun 24, 2024Updated 2 years ago
BeyondScene / BeyondScene
View on GitHub
[ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
☆21Jul 2, 2024Updated 2 years ago
vl-rewardbench / VL_RewardBench
View on GitHub
☆29Jul 23, 2025Updated last year
kaist-ami / BEAF
View on GitHub
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆22Mar 26, 2025Updated last year
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
patrick-tssn / VideoHallucer
View on GitHub
VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
☆43Dec 16, 2025Updated 7 months ago
google-deepmind / svo_probes
View on GitHub
The SVO-Probes Dataset for Verb Understanding
☆29Jan 28, 2022Updated 4 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
TIGER-AI-Lab / VIEScore
View on GitHub
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…
☆68Nov 19, 2024Updated last year
ajd12342 / why-winoground-hard
View on GitHub
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31May 29, 2023Updated 3 years ago
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
songrise / MLLM4Art
View on GitHub
[ACM MM 2025] MLLMs for Aesthetics Reasoning
☆26Jan 5, 2026Updated 6 months ago
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
gemlab-vt / CONFORM
View on GitHub
Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models [CVPR 2024]
☆27Oct 7, 2024Updated last year
Davidelanz / pytorch-hed
View on GitHub
Python Package reimplementation of Holistically-Nested Edge Detection in PyTorch
☆12Jan 5, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mlfoundations / VisIT-Bench
View on GitHub
☆51Oct 29, 2023Updated 2 years ago
hucvl / craft
View on GitHub
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
☆16Jun 10, 2021Updated 5 years ago
Baiqi-Li / NaturalBench
View on GitHub
🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…
☆90Jun 24, 2025Updated last year
RAIVNLab / sugar-crepe
View on GitHub
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆94Feb 13, 2024Updated 2 years ago
ElvishElvis / LCA-on-the-line
View on GitHub
LCA-on-the-line (ICML 2024 Oral)
☆14Feb 13, 2025Updated last year
Lookuz / VidHal
View on GitHub
Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs
☆14Apr 23, 2026Updated 3 months ago
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 19, 2026Updated last week
naver-ai / prolip
View on GitHub
☆58Aug 16, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
kongdai123 / consistency2
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
TobiasLee / VEC
View on GitHub
Visual and Embodied Concepts evaluation benchmark
☆21Oct 10, 2023Updated 2 years ago
clarken92 / DisentanglementMetrics
View on GitHub
This repository contains the full code for our paper "Theory and evaluation metrics for learning disentangled representations"
☆15Feb 13, 2021Updated 5 years ago
RifleZhang / LLaVA-Hound-DPO
View on GitHub
☆158Oct 31, 2024Updated last year
yonatanbitton / wysiwyr
View on GitHub
☆37Oct 7, 2023Updated 2 years ago
T-Lab-CUHKSZ / G2RPO-A
View on GitHub
[ACL 2026] G2RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
☆16May 20, 2026Updated 2 months ago
yeppp27 / VisualScore
View on GitHub
☆21May 28, 2026Updated 2 months ago