IDEA-FinAI / ChartMoE
☆18Updated this week
Alternatives and similar repositories for ChartMoE:
Users that are interested in ChartMoE are comparing it to the libraries listed below
- ☆34Updated this week
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆28Updated 5 months ago
- ☆23Updated 7 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆48Updated last month
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆61Updated 6 months ago
- Official implement of MIA-DPO☆45Updated last month
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆73Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆40Updated last month
- The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".☆46Updated last month
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆57Updated last week
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆105Updated 3 weeks ago
- Code release for VTW (AAAI 2025)☆27Updated last week
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 7 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆28Updated 5 months ago
- MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆55Updated 3 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆72Updated 5 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆21Updated 9 months ago
- ☆87Updated 11 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆35Updated 2 months ago
- Official repository of MMDU dataset☆78Updated 2 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆56Updated 3 weeks ago
- ☆38Updated 4 months ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆56Updated 2 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆19Updated 5 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆44Updated last month
- This is the official repo for ByteVideoLLM/Dynamic-VLM☆16Updated this week
- MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆22Updated last week
- Making LLaVA Tiny via MoE-Knowledge Distillation☆70Updated last month
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆90Updated 5 months ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆73Updated 10 months ago