lalithjets / SurgicalGPTLinks
☆28Updated last year
Alternatives and similar repositories for SurgicalGPT
Users that are interested in SurgicalGPT are comparing it to the libraries listed below
Sorting:
- ☆19Updated 11 months ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆56Updated 2 years ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆23Updated 9 months ago
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆46Updated 4 months ago
- LLaVa Version of RaDialog☆23Updated 4 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆22Updated 7 months ago
- Official code for the Paper "RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance"☆105Updated 4 months ago
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆55Updated last year
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆31Updated 11 months ago
- The official GitHub repository of the AAAI-2024 paper "Bootstrapping Large Language Models for Radiology Report Generation".☆60Updated last year
- The official code to build up dataset PMC-OA☆32Updated last year
- This repository is made for the paper: Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medica…☆47Updated last year
- ☆20Updated 9 months ago
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data☆23Updated 3 months ago
- Chest X-Ray Explainer (ChEX)☆21Updated 8 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆42Updated 10 months ago
- [NeurIPS 2025][OralGPT & MMOral] Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Digital Dentistry☆32Updated this week
- Multi-Aspect Vision Language Pretraining - CVPR2024☆82Updated last year
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆22Updated 10 months ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆88Updated last year
- ☆43Updated last year
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆38Updated 4 months ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆75Updated 3 weeks ago
- The official implementation of "ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training"☆42Updated 3 weeks ago
- Radiology Report Generation with Frozen LLMs☆95Updated last year
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆83Updated 6 months ago
- ☆39Updated last year
- Code implementation of RP3D-Diag☆16Updated 10 months ago
- Official repository for the paper "Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting" (MICCAI23)☆29Updated last year
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆43Updated 2 months ago