manglu097 / Chiron-o1Links
[NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search
☆71Updated 3 months ago
Alternatives and similar repositories for Chiron-o1
Users that are interested in Chiron-o1 are comparing it to the libraries listed below
Sorting:
- [AAAI-2026] Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning☆45Updated 2 months ago
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态因果推理开源框架)☆1,152Updated 3 months ago
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆111Updated 2 weeks ago
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆452Updated last month
- ☆160Updated last week
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆160Updated 2 months ago
- ☆34Updated 9 months ago
- An official implementation of PCRLv2 (pre-training and fine-tuning code are included).☆113Updated 2 years ago
- ☆120Updated 2 months ago
- LA-ViT: A Network with Transformers constrained by Learned-parameters-free Attention for Interpretable Grading in A New Laryngeal Histopa…☆20Updated 7 months ago
- JarvisX-Cowork: Your First Personal AI Creative Assistant for Everyone!☆79Updated this week
- [AAAI-2026] Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner☆92Updated 2 months ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆126Updated 5 months ago
- [IEEE TASE 2025] The Official Implementation for ''Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Clo…☆108Updated 2 weeks ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆843Updated 2 months ago
- Empowering MLLM for Grounded ECG Understanding with Time Series and Images [NeurIPS 2025]☆253Updated last month
- ☆100Updated last month
- [ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs☆130Updated 7 months ago
- ☆120Updated 3 years ago
- [NeurIPS 2025] Native-resolution diffusion Transformer☆291Updated 3 months ago
- ☆398Updated 2 years ago
- ☆169Updated 5 months ago
- [CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration☆250Updated last month
- PyGDA is a Python library for Graph Domain Adaptation☆240Updated 5 months ago
- Oriented Bounding Box (OBB) -based Instance Segmentation (MIR 2025)☆88Updated last year
- [AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story…☆134Updated 3 months ago
- ☆32Updated 7 months ago
- Jacobi Forcing: Fast and Accurate Diffusion-style Decoding☆153Updated 3 weeks ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆82Updated 9 months ago
- 🔥 JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization☆272Updated this week