Yijia-Xiao / LogicVistaLinks
☆16Updated last year
Alternatives and similar repositories for LogicVista
Users that are interested in LogicVista are comparing it to the libraries listed below
Sorting:
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆17Updated 6 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆58Updated last year
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆19Updated last year
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40Updated 7 months ago
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆11Updated last year
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆67Updated 8 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆50Updated last year
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Updated 6 months ago
- Official Repository of LatentSeek☆73Updated 7 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆84Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆52Updated 3 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆53Updated 6 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆84Updated 2 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆53Updated 9 months ago
- GRPO Algorithm for Llava Architecture (Based on Verl)☆45Updated 8 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆73Updated 2 months ago
- ☆109Updated last year
- ☆24Updated 6 months ago
- ☆60Updated 2 weeks ago
- A collection of awesome think with videos papers.☆76Updated last month
- A paper list of Awesome Latent Space.☆276Updated last week
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Updated last year
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆92Updated last year
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆96Updated last month
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆87Updated 10 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆81Updated 3 weeks ago