DoubtedSteam/MM-GCoT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DoubtedSteam/MM-GCoT)

DoubtedSteam / MM-GCoT

The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"

☆22

Alternatives and similar repositories for MM-GCoT

Users that are interested in MM-GCoT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 5 months ago
patrick-0817 / T-MASS-dataleakage
View on GitHub
☆10Nov 27, 2024Updated last year
maifoundations / GCoT
View on GitHub
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
☆15Aug 11, 2025Updated 11 months ago
shawnricecake / Heima
View on GitHub
[ICML 2026] Heima
☆75May 20, 2026Updated 2 months ago
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ghchen18 / acl23_mclip
View on GitHub
The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'
☆10Jan 23, 2024Updated 2 years ago
RUCAIBox / LMM-Searcher
View on GitHub
The official code of "Towards Long-horizon Agentic Multimodal Search"
☆27Apr 17, 2026Updated 3 months ago
yunfanLu / Self-EvRSVFI
View on GitHub
[IEEE TVCG 2025] Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames
☆11Jun 1, 2025Updated last year
Longin-Yu / ComRoPE
View on GitHub
☆11Jun 11, 2025Updated last year
saccharomycetes / mllms_know
View on GitHub
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆382Apr 20, 2025Updated last year
farewellthree / Causal-Context-Debiasing
View on GitHub
CCD： Official PyTorch implementation of the paper "Contextual Debiasing for Visual Recognition with Causal Mechanisms"
☆17Jan 26, 2023Updated 3 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
IndigoPurple / CUBE
View on GitHub
Code and data of "Controllable Unsupervised Event-based Video Generation" (accepted as ICIP oral and invited by WACV workshop)
☆19Nov 5, 2024Updated last year
Hansxsourse / VRMDiff
View on GitHub
☆11Mar 11, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhifanzhu / jcat
View on GitHub
jcat (jupyter cat) is a command line tool for viewing notebook(*.ipynb) files in terminal.
☆10Sep 17, 2022Updated 3 years ago
jylins / hourllava
View on GitHub
[NeurIPS 2025 Spotlight] Unleashing Hour-Scale Video Training for Long Video-Language Understanding
☆19Jun 24, 2025Updated last year
huangzicheng / CornerNet-Lite
View on GitHub
training for VOC dataset
☆11Nov 7, 2019Updated 6 years ago
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
Yovecent / UDM-GRPO
View on GitHub
[ICML 2026 Spotlight] UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
☆27May 1, 2026Updated 2 months ago
yandex-research / graphland
View on GitHub
GraphLand: Evaluating Graph Machine Learning Models on Diverse Industrial Data
☆37Apr 8, 2026Updated 3 months ago
cs-holder / Reasoning-Self-Evolution-Survey
View on GitHub
☆54Mar 6, 2025Updated last year
apsk14 / semantic_scene_representations
View on GitHub
Official Pytorch implementation of Semantic Implicit Neural Scene Representations with Semi-Supervised Training
☆13Jan 3, 2022Updated 4 years ago
BRZ911 / Wrong-of-Thought
View on GitHub
[EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
☆13Oct 1, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
BoomShakaY / causal-vision-nlp
View on GitHub
☆23Jul 30, 2023Updated 2 years ago
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆189Jun 5, 2025Updated last year
A-suozhang / CodedVTR
View on GitHub
code of [CVPR22] CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance
☆18Jul 10, 2022Updated 4 years ago
AntResearchNLP / ViLaSR
View on GitHub
[NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
☆98Jul 27, 2025Updated 11 months ago
KHao123 / LaSe-E2V
View on GitHub
The source code for "LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction"
☆10Jul 5, 2024Updated 2 years ago
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
Haochen-Wang409 / TreeVGR
View on GitHub
[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
☆92Jan 26, 2026Updated 5 months ago
michaelxuzhi / oneNiceUIPage
View on GitHub
vue+elementUI 创建的一个好看的UI页面。暂时无js代码，只作为UI展示。
☆11Feb 4, 2023Updated 3 years ago
WayneTomas / Artemis
View on GitHub
This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".
☆15Dec 4, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
suoych / KEDs
View on GitHub
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
☆20Nov 4, 2024Updated last year
ECNU-Cross-Innovation-Lab / Mamba-Spike
View on GitHub
Mamba-Spike——CGI2024
☆14Dec 3, 2025Updated 7 months ago
anvo25 / vlms-are-biased
View on GitHub
Vision Language Models are Biased
☆114Jan 26, 2026Updated 5 months ago
xiaominli1020 / ReNeg
View on GitHub
ReNeg: Learning Negative Embedding with Reward Guidance
☆35Dec 22, 2025Updated 7 months ago
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
IDEA-Research / V-Reflection
View on GitHub
Related code, checkpoints and project page for V-Reflection
☆60Apr 7, 2026Updated 3 months ago
cyizhuo / CIFAR-100-dataset
View on GitHub
CIFAR-100 dataset by classes folder
☆11Nov 7, 2024Updated last year