Yangyi-Chen/CoTConsistency

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yangyi-Chen/CoTConsistency)

Yangyi-Chen / CoTConsistency

The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".

☆34

Alternatives and similar repositories for CoTConsistency

Users that are interested in CoTConsistency are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mzeeshankaramat / SafeAgents
View on GitHub
☆20Jun 4, 2026Updated last month
IST-DASLab / sparse-imagenet-transfer
View on GitHub
Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022
☆10Jun 3, 2022Updated 4 years ago
HYPJUDY / Sparkles
View on GitHub
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
☆45Jun 14, 2024Updated 2 years ago
thunlp / LLM-generated-text-detection
View on GitHub
☆13Nov 7, 2023Updated 2 years ago
hananshafi / MedContext
View on GitHub
[MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"
☆14Nov 1, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Control-xl / Medical-Vision-Langauge-Transformer
View on GitHub
☆17Nov 1, 2023Updated 2 years ago
ramakanth-pasunuru / QmdsCnnIr
View on GitHub
☆12Mar 18, 2021Updated 5 years ago
hananshafi / MTL-ViT
View on GitHub
A new multi-task learning framework using Vision Transformers
☆11Jun 19, 2024Updated 2 years ago
dali-does / clevr-math
View on GitHub
☆13May 9, 2023Updated 3 years ago
MadryLab / bias-transfer
View on GitHub
☆15Jul 24, 2022Updated 4 years ago
rtaori / data_feedback
View on GitHub
Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"
☆18Sep 9, 2022Updated 3 years ago
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
gefend / LIMITR
View on GitHub
Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation
☆17Updated this week
Yangyi-Chen / PaperList-Trustworthy-Applications
View on GitHub
Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…
☆21May 30, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
FreedomIntelligence / TRIM
View on GitHub
We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…
☆22Jan 11, 2026Updated 6 months ago
tianyi-lab / HallusionBench
View on GitHub
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…
☆342Oct 14, 2025Updated 9 months ago
akhtarvision / bpc_calibration
View on GitHub
[CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection
☆31Jun 21, 2023Updated 3 years ago
Timothyxxx / NeuralSymbolicPapers
View on GitHub
☆14Aug 18, 2022Updated 3 years ago
techmn / cosnet
View on GitHub
A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)
☆12Aug 11, 2025Updated 11 months ago
yushundong / Fairness-must-read-list
View on GitHub
Papers on fairness
☆12Oct 20, 2020Updated 5 years ago
HashmatShadab / Robustness-of-Volumetric-Medical-Segmentation-Models
View on GitHub
[BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
☆15Nov 1, 2024Updated last year
FuxiaoLiu / DocumentCLIP
View on GitHub
[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
☆16Apr 4, 2024Updated 2 years ago
ilkerkesen / ViLMA
View on GitHub
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
☆16Jan 18, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ShahinaKK / LWI-VMS
View on GitHub
Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]
☆22Oct 27, 2024Updated last year
Hasindri / HLSS
View on GitHub
[MICCAI 2024 🔥] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descripti…
☆27Aug 5, 2024Updated last year
allenai / sherlock
View on GitHub
Code, data, models for the Sherlock corpus
☆62Nov 11, 2022Updated 3 years ago
wuxiyang1996 / AutoHallusion
View on GitHub
AutoHallusion Codebase (EMNLP 2024)
☆23Dec 6, 2024Updated last year
allenai / multimodalqa
View on GitHub
☆158Oct 12, 2022Updated 3 years ago
junyangwang0410 / HaELM
View on GitHub
An automatic MLLM hallucination detection framework
☆19Sep 26, 2023Updated 2 years ago
tyshiwo1 / Awesome-Visual-Tokenizer
View on GitHub
Awesome Visual Tokenizers/Autoencoders
☆20Nov 19, 2025Updated 8 months ago
shikras / shikra
View on GitHub
☆814Jul 8, 2024Updated 2 years ago
alvinliu0 / Visual-Sound-Localization-in-the-Wild
View on GitHub
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Feb 15, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BioMedIA-MBZUAI / CoReEcho
View on GitHub
☆16Sep 23, 2024Updated last year
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
zhaoyucs / VSD
View on GitHub
Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"
☆25Mar 9, 2024Updated 2 years ago
cg1177 / Recursive-Multimodal-Agent
View on GitHub
☆19Jul 1, 2026Updated 3 weeks ago
xiye17 / SketchRegex
View on GitHub
Sketch Driven Regular Expression Generation.
☆17Apr 26, 2023Updated 3 years ago
eth-sri / smoothing-ensembles
View on GitHub
[ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers
☆11Mar 29, 2022Updated 4 years ago