SooLab/DDCOT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SooLab/DDCOT)

SooLab / DDCOT

[NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models

☆49

Alternatives and similar repositories for DDCOT

Users that are interested in DDCOT are comparing it to the libraries listed below

Sorting:

chancharikmitra / CCoT
View on GitHub
[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"
☆145Jun 20, 2024Updated last year
CAMMA-public / SSG-VQA
View on GitHub
[IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge
☆47May 23, 2025Updated 9 months ago
chengtan9907 / mc-cot
View on GitHub
The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…
☆26May 19, 2024Updated last year
maximek3 / MIMIC-NLE
View on GitHub
☆21Jul 25, 2022Updated 3 years ago
zhung2 / uvtranse
View on GitHub
☆10Jun 1, 2019Updated 6 years ago
sangminwoo / RITUAL
View on GitHub
Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…
☆14Dec 16, 2024Updated last year
ExplainableML / Probabilistic_Deep_Metric_Learning
View on GitHub
This repository contains the code for our ECCV 2022 paper on our "Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning".
☆12Dec 6, 2022Updated 3 years ago
FatemehShiri / Spatial-MM
View on GitHub
☆12Jan 10, 2025Updated last year
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆11Jul 28, 2025Updated 7 months ago
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆12Sep 11, 2025Updated 5 months ago
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆61Jul 16, 2024Updated last year
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated last year
SooLab / REP-ERU
View on GitHub
[ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embo…
☆13Mar 20, 2023Updated 2 years ago
ggg0919 / cantor
View on GitHub
☆91May 10, 2024Updated last year
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
lizhaoliu-Lec / DAS
View on GitHub
This is the official repo for Densely-Anchored Sampling for Deep Metric Learning (ECCV 22).
☆16May 24, 2024Updated last year
LALBJ / PAI
View on GitHub
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
☆164Nov 6, 2024Updated last year
synlp / R2-LLM
View on GitHub
The official GitHub repository of the AAAI-2024 paper "Bootstrapping Large Language Models for Radiology Report Generation".
☆66Apr 23, 2024Updated last year
Hxyou / IdealGPT
View on GitHub
Official Code of IdealGPT
☆35Oct 13, 2023Updated 2 years ago
bzluan / TextCoT
View on GitHub
The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.
☆44Sep 24, 2024Updated last year
junha1125 / Domain-Adaptation-Generalization-in-ECCV-2024
View on GitHub
☆16Sep 29, 2024Updated last year
thomaswei-cn / MC-CoT
View on GitHub
MC-CoT implementation code
☆22Jun 24, 2025Updated 8 months ago
ExplainableML / NonIsotropicProxyDML
View on GitHub
This repository contains the code for our CVPR 2022 paper on "Non-isotropy Regularization for Proxy-based Deep Metric Learning".
☆15Mar 10, 2023Updated 2 years ago
kdiAAA / TDA
View on GitHub
[CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"
☆115Jul 15, 2024Updated last year
YiyangZhou / LURE
View on GitHub
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
☆155Apr 30, 2024Updated last year
TIGER-AI-Lab / ABC
View on GitHub
ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]
☆20Aug 21, 2025Updated 6 months ago
sky4689524 / Pytorch_AdversarialAttacks
View on GitHub
Pytorch implementation with segmentation model and adversarial attacks
☆14Oct 20, 2019Updated 6 years ago
jungao1106 / ICoT
View on GitHub
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆106Dec 30, 2025Updated 2 months ago
sanyeungwang / PML
View on GitHub
[CVPR 2021] This repository is the official implementation of "PML: Progressive Margin Loss for Long-tailed Age Classification."
☆17Mar 13, 2024Updated last year
Liqq1 / awesome-medical-vision-and-language-pretraining
View on GitHub
The collection of medical VLP papars
☆20Jul 24, 2024Updated last year
mbsariyildiz / resnet-pytorch
View on GitHub
☆18May 25, 2018Updated 7 years ago
1zhou-Wang / MemVR
View on GitHub
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆172Sep 25, 2025Updated 5 months ago
mikecheninoulu / SMG
View on GitHub
SMG source code and dataset
☆18May 10, 2023Updated 2 years ago
philip-mueller / chex
View on GitHub
Chest X-Ray Explainer (ChEX)
☆23Jan 30, 2025Updated last year
liyongqi67 / LTRGR
View on GitHub
☆21Aug 9, 2024Updated last year
mikecheninoulu / Emotional-gesture-papers
View on GitHub
☆21May 29, 2025Updated 9 months ago
Meituan-AutoML / Lenna
View on GitHub
☆86Feb 5, 2024Updated 2 years ago
YukunLi99 / AdaptSAM
View on GitHub
☆22Jun 30, 2023Updated 2 years ago
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆18Mar 14, 2024Updated last year