FreedomIntelligence / MedGenLinks
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.
☆30Updated 6 months ago
Alternatives and similar repositories for MedGen
Users that are interested in MedGen are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Updated 8 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Updated 9 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆111Updated 3 months ago
- MedEvalKit: A Unified Medical Evaluation Framework☆208Updated 3 months ago
- [ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"☆103Updated last month
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆60Updated 4 months ago
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆81Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- ☆48Updated 11 months ago
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆102Updated 6 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Updated last year
- Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning☆18Updated 4 months ago
- [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning☆46Updated last month
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆92Updated 2 months ago
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆38Updated 4 months ago
- ☆42Updated 6 months ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆149Updated 3 months ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆46Updated last year
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Updated last month
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆52Updated 6 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Updated 11 months ago
- The official code for MedAgent_Pro☆94Updated 5 months ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆37Updated 9 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Updated 3 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Updated 7 months ago
- [NeurIPS 2024] Efficient Large Multi-modal Models via Visual Context Compression☆64Updated 11 months ago
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆64Updated 2 years ago
- DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue☆64Updated 2 weeks ago
- CLIP-MoE: Mixture of Experts for CLIP☆55Updated last year
- ☆60Updated 2 months ago