code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"
☆19Mar 10, 2025Updated 11 months ago
Alternatives and similar repositories for CoMT
Users that are interested in CoMT are comparing it to the libraries listed below
Sorting:
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆14Nov 18, 2023Updated 2 years ago
- ☆88Jun 7, 2024Updated last year
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆278Aug 5, 2025Updated 7 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 6 months ago
- we explores the fascinating domain of text-to-image generation using the powerful capabilities of the Flux API. The objective is to trans…☆12Aug 14, 2024Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Sep 29, 2025Updated 5 months ago
- ☆21Aug 8, 2025Updated 6 months ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago
- Code for training and evaluation on the "Industrial Language-Image Dataset (ILID)".☆10Jun 4, 2025Updated 9 months ago
- ☆41Apr 29, 2024Updated last year
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- 哈尔滨工业大学 软件架构与中间件 实验 2022春☆10Sep 21, 2023Updated 2 years ago
- [ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"☆12Dec 20, 2024Updated last year
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆13Jul 1, 2025Updated 8 months ago
- init☆11May 25, 2025Updated 9 months ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- [NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"☆26Nov 21, 2025Updated 3 months ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆21Jan 27, 2026Updated last month
- CRNN with Self-Attention☆10Apr 8, 2018Updated 7 years ago
- ☆21Feb 13, 2026Updated 3 weeks ago
- ☆16Oct 12, 2025Updated 4 months ago
- ☆12Feb 9, 2025Updated last year
- ☆13Dec 2, 2024Updated last year
- Sequence Labeling Parsing by Learning Across Representations☆13Oct 3, 2019Updated 6 years ago
- A paddle implementation of "Masked Autoencoders Are Scalable Vision Learners"☆11Feb 24, 2022Updated 4 years ago
- [ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios☆23Jul 2, 2025Updated 8 months ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- ☆13Sep 26, 2025Updated 5 months ago
- This repository contains the implementation for Anomaly Detection using Score-based Perturbation Resilience (ICCV 2023)☆14Sep 6, 2024Updated last year
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆12Nov 14, 2023Updated 2 years ago
- ☆12Oct 9, 2018Updated 7 years ago
- ☆10Apr 22, 2019Updated 6 years ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Feb 13, 2025Updated last year
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 10 years ago
- [Accepted by TGRS2022] Official code of the paper "Multi-branch Feature Difference Learning Network for Cross-Spectral Image Patch Matchi…☆13Feb 9, 2025Updated last year