Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformers", MICCAI 2022.
☆62Mar 27, 2023Updated 2 years ago
Alternatives and similar repositories for Surgical_VQA
Users that are interested in Surgical_VQA are comparing it to the libraries listed below
Sorting:
- ☆14Nov 28, 2024Updated last year
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆24Jul 7, 2024Updated last year
- ☆13Nov 19, 2020Updated 5 years ago
- ☆29Feb 7, 2024Updated 2 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- ☆16Jul 5, 2021Updated 4 years ago
- ☆19Dec 19, 2025Updated 2 months ago
- Globally reasoned multi-task model for surgical scene understanding. A multi-task model for segmentation and scene graph. Offical Impleme…☆14May 5, 2022Updated 3 years ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- ☆17May 19, 2023Updated 2 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Mar 28, 2023Updated 2 years ago
- Official repository of the GraSP dataset and implemention of TAPIS☆50Dec 31, 2024Updated last year
- ☆16Sep 17, 2025Updated 5 months ago
- VQA-Med 2021☆22Jul 11, 2022Updated 3 years ago
- ☆30Sep 16, 2024Updated last year
- Localized questions for VQA☆11May 6, 2025Updated 10 months ago
- ☆12Mar 18, 2024Updated last year
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆52Updated this week
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆46Apr 19, 2024Updated last year
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆47May 23, 2025Updated 9 months ago
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆38Mar 22, 2021Updated 4 years ago
- CholecTriplet 2022 challenge on surgical action triplet detection☆12Sep 17, 2025Updated 5 months ago
- ☆19Sep 19, 2025Updated 5 months ago
- A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…☆75Sep 17, 2025Updated 5 months ago
- Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…☆17Jul 7, 2024Updated last year
- Visual Question Answering in the Medical Domain VQA-Med 2019☆94Jan 12, 2024Updated 2 years ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆79Sep 14, 2025Updated 5 months ago
- AIOZ AI - Overcoming Data Limitation in Medical Visual Question Answering (MICCAI 2019)☆69Oct 3, 2023Updated 2 years ago
- SurgLaVi: Official repository☆27Updated this week
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆61Jul 5, 2025Updated 8 months ago
- [ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention☆35Dec 15, 2022Updated 3 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 9 months ago
- (TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery☆25Nov 13, 2024Updated last year
- Large-scale Self-supervised Pre-training for Endoscopy☆44Jun 11, 2024Updated last year
- CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery☆15Dec 18, 2025Updated 2 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- ☆14Jul 8, 2024Updated last year
- Laparoscopic video dataset for surgical action triplet recognition☆43Sep 17, 2025Updated 5 months ago