SPIRAL-MED / Ophiuchus
☆36Updated 3 months ago
Alternatives and similar repositories for Ophiuchus:
Users that are interested in Ophiuchus are comparing it to the libraries listed below
- ☆48Updated last month
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆24Updated last week
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imaging☆32Updated 3 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆61Updated this week
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆52Updated 5 months ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆84Updated this week
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆99Updated last month
- Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?☆20Updated last month
- ☆54Updated last month
- A Self-Training Framework for Vision-Language Reasoning☆76Updated 2 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆106Updated last week
- ☆107Updated 3 weeks ago
- ☆73Updated 3 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆58Updated 2 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Updated last year
- This the implementation of LeCo☆32Updated 3 months ago
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆60Updated last year
- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆30Updated 3 months ago
- ☆38Updated last month
- ☆71Updated 10 months ago
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆65Updated 4 months ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆18Updated 2 months ago
- Official Code of IdealGPT☆35Updated last year
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆46Updated last year
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆81Updated 6 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆23Updated 6 months ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs☆10Updated 2 weeks ago
- MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆54Updated last month
- ☆73Updated last year
- ☆55Updated 6 months ago