Yijia-Xiao / LogicVistaLinks
☆16Updated last year
Alternatives and similar repositories for LogicVista
Users that are interested in LogicVista are comparing it to the libraries listed below
Sorting:
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Updated last year
- ☆14Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆19Updated 7 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆51Updated last year
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆58Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Updated 4 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69Updated last year
- ☆11Updated last year
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆74Updated 3 months ago
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆20Updated last year
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Updated last year
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆84Updated 3 months ago
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆92Updated 2 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆47Updated 7 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- A hot-pluggable tool for visualizing LLaVA's attention.☆24Updated 2 years ago
- Collection of latest papers and materials in the area of RLVR!☆52Updated this week
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40Updated 8 months ago
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Updated 7 months ago
- Official Repository of LatentSeek☆76Updated 7 months ago
- [NeurIPS 2025] Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models☆60Updated 2 months ago
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆68Updated 8 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆20Updated 8 months ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆96Updated 2 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- ICLR 2025☆30Updated 8 months ago
- GRPO Algorithm for Llava Architecture (Based on Verl)☆47Updated 8 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆110Updated last month
- ☆60Updated last year