HITsz-TMG / VisionGraph
The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context"
☆13Updated 7 months ago
Alternatives and similar repositories for VisionGraph:
Users that are interested in VisionGraph are comparing it to the libraries listed below
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated 7 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆38Updated 2 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- This is a unified platform for performing prompting engineering in large language models (LLMs).☆11Updated this week
- An Easy-to-use Hallucination Detection Framework for LLMs.☆55Updated 8 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated last year
- MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆32Updated last month
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆40Updated 2 months ago
- Code and Data Repo for [NeurIPS 2024] Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆21Updated 7 months ago
- A trainable user simulator☆32Updated 4 months ago
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆17Updated 6 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆11Updated 5 months ago
- ☆21Updated 5 months ago
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated 3 months ago
- ☆37Updated 2 months ago
- Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆41Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 3 weeks ago
- ☆28Updated 11 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆43Updated 2 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆27Updated last month
- ☆54Updated 9 months ago
- ☆15Updated 2 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆38Updated 3 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated last month
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆38Updated 2 months ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆28Updated last year
- ☆11Updated 5 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆13Updated 3 weeks ago
- A Self-Training Framework for Vision-Language Reasoning☆60Updated 2 months ago