cheliu-computation / Med-R1-Alpha
Unleashing Reasoning in Medical Large Language Models
☆11Updated last month
Alternatives and similar repositories for Med-R1-Alpha:
Users that are interested in Med-R1-Alpha are comparing it to the libraries listed below
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆86Updated last year
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆39Updated 6 months ago
- ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?☆111Updated last month
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆136Updated last month
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆65Updated 4 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆24Updated last month
- Code release for VTW (AAAI 2025) Oral☆34Updated 3 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆126Updated 11 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆162Updated 3 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆92Updated 5 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆100Updated 3 weeks ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆33Updated last week
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆298Updated 4 months ago
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆22Updated 2 weeks ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆123Updated 10 months ago
- ☆115Updated 8 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆139Updated 2 months ago
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆44Updated last year
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.☆56Updated last month
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆234Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆35Updated last month
- Visual Instruction Tuning for Qwen2 Base Model☆32Updated 9 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆86Updated 3 weeks ago
- Description for MV-MATH☆12Updated last month
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆29Updated 3 months ago
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆43Updated 2 weeks ago
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆154Updated 7 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆50Updated 3 months ago
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆63Updated last month
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆325Updated 2 weeks ago