lalithjets / SurgicalGPTLinks
☆29Updated last year
Alternatives and similar repositories for SurgicalGPT
Users that are interested in SurgicalGPT are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆59Updated 2 years ago
- LLaVa Version of RaDialog☆24Updated 6 months ago
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆45Updated 6 months ago
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆27Updated last year
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆76Updated 2 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆24Updated 9 months ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Updated 11 months ago
- [NeurIPS'25][OralGPT & MMOral] The official repo of OralGPT & MMOral Bench.☆49Updated this week
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆92Updated last year
- Official code for the Paper "RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance"☆106Updated 6 months ago
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆46Updated 2 weeks ago
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆55Updated last year
- Expert-level AI radiology report evaluator☆35Updated 8 months ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆44Updated last year
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆32Updated last year
- AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation☆45Updated 7 months ago
- The collection of medical VLP papars☆19Updated last year
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆42Updated 6 months ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆59Updated 5 months ago
- The official code for MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in Radiology. We propose to leverage medical specif…☆174Updated 2 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆24Updated 2 years ago
- The official code to build up dataset PMC-OA☆33Updated last year
- Chest X-Ray Explainer (ChEX)☆21Updated 10 months ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆17Updated 6 months ago
- Code implementation of RP3D-Diag☆16Updated last year
- [MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.☆124Updated 3 years ago
- ☆43Updated 2 years ago
- The repo of ASGMVLP☆17Updated last year
- The code for paper: PeFoM-Med: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆56Updated 5 months ago