lalithjets / SurgicalGPTLinks
☆29Updated last year
Alternatives and similar repositories for SurgicalGPT
Users that are interested in SurgicalGPT are comparing it to the libraries listed below
Sorting:
- ☆19Updated last month
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆61Updated 2 years ago
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆45Updated 7 months ago
- LLaVa Version of RaDialog☆25Updated 7 months ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆79Updated 4 months ago
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆28Updated last year
- [NeurIPS'25][OralGPT & MMOral] The official repo of OralGPT & MMOral Bench.☆60Updated last week
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Updated last year
- [NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine☆28Updated 10 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Updated 10 months ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Updated last year
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆45Updated last year
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆34Updated 4 months ago
- Expert-level AI radiology report evaluator☆35Updated 9 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆46Updated 2 weeks ago
- Multi-Aspect Vision Language Pretraining - CVPR2024☆87Updated last year
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆85Updated 7 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Updated 2 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆87Updated 9 months ago
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆48Updated last month
- ☆20Updated last year
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆55Updated last year
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆94Updated last year
- Official code for the Paper "RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance"☆110Updated 7 months ago
- The repo of ASGMVLP☆17Updated this week
- Chest X-Ray Explainer (ChEX)☆22Updated 11 months ago
- The official code to build up dataset PMC-OA☆34Updated last year
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆59Updated 6 months ago
- The official code for MedAgent_Pro☆85Updated 4 months ago
- Official repository for the paper "Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting" (MICCAI23)☆32Updated 2 years ago