DoubtedSteam / MM-GCoTLinks
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆13Updated 2 weeks ago
Alternatives and similar repositories for MM-GCoT
Users that are interested in MM-GCoT are comparing it to the libraries listed below
Sorting:
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆33Updated 8 months ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆20Updated 9 months ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs