weijingxuan / COCO-MMRLinks
☆11Updated last year
Alternatives and similar repositories for COCO-MMR
Users that are interested in COCO-MMR are comparing it to the libraries listed below
Sorting:
- ☆20Updated 10 months ago
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Updated 11 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Updated 9 months ago
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…☆24Updated last year
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆50Updated 9 months ago
- Official Implementation (Pytorch) of the "LLaMo: Large Language Model-based Molecular Graph Assistant", NeurIPS 2024☆31Updated 6 months ago
- [🏆AAAI2025] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.☆50Updated 3 weeks ago
- ☆69Updated this week
- Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model☆28Updated 2 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆42Updated 4 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆80Updated last week
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆23Updated 7 months ago
- GRPO Algorithm for Llava Architecture (Based on Verl)☆37Updated 3 months ago
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆22Updated last year
- ☆31Updated last year
- Awesome Long-CoT Data☆17Updated 5 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆79Updated 9 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆46Updated last month
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models☆67Updated last month
- ☆46Updated this week
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆11Updated 5 months ago
- The official implementation of the CVPR'2022 paper Hyperspherical Consistency Regularization.☆29Updated 3 years ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆69Updated last month
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆149Updated last year
- ☆17Updated last year
- ☆12Updated 4 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆47Updated last month
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆39Updated 5 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆55Updated 10 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆61Updated this week