Quinn777 / AtomThink
☆51Updated last month
Alternatives and similar repositories for AtomThink:
Users that are interested in AtomThink are comparing it to the libraries listed below
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆78Updated 3 months ago
- A Self-Training Framework for Vision-Language Reasoning☆60Updated 2 months ago
- ☆57Updated 7 months ago
- ☆49Updated last week
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆76Updated 6 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆22Updated 4 months ago
- ☆44Updated 3 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆103Updated 6 months ago
- ☆94Updated last year
- ☆73Updated 10 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆40Updated 2 months ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆67Updated last month
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆136Updated this week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆55Updated 2 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated 7 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆62Updated last month
- ☆59Updated 11 months ago
- ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI☆96Updated 6 months ago
- ☆27Updated 2 weeks ago
- A RLHF Infrastructure for Vision-Language Models☆145Updated 2 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆63Updated 6 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆68Updated this week
- An Easy-to-use Hallucination Detection Framework for LLMs.☆55Updated 8 months ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆24Updated 3 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆61Updated last month
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆42Updated last year
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆109Updated last month
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆28Updated 6 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆62Updated 7 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆38Updated last month