SPIRAL-MED / Ophiuchus
☆34Updated last month
Alternatives and similar repositories for Ophiuchus:
Users that are interested in Ophiuchus are comparing it to the libraries listed below
- ☆44Updated 4 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆77Updated 4 months ago
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imaging☆28Updated last month
- ☆45Updated 4 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆50Updated 3 months ago
- ☆61Updated 8 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆89Updated last month
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆63Updated 8 months ago
- ☆73Updated 11 months ago
- This the implementation of LeCo☆30Updated last month
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆54Updated 2 weeks ago
- ☆79Updated 2 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆49Updated 4 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆48Updated 2 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆40Updated 3 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆96Updated last month
- [ICLR 2025] SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights☆53Updated last week
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆77Updated 7 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆41Updated last month
- ☆64Updated 2 weeks ago
- ☆58Updated 5 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆23Updated 4 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last week
- official implementation of paper "Process Reward Model with Q-value Rankings"☆48Updated 2 weeks ago
- ☆27Updated last month
- The code and data for the paper JiuZhang3.0☆40Updated 8 months ago