Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformers", MICCAI 2022.
☆64Mar 27, 2023Updated 3 years ago
Alternatives and similar repositories for Surgical_VQA
Users that are interested in Surgical_VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆25Jul 7, 2024Updated last year
- ☆28Feb 7, 2024Updated 2 years ago
- ☆15Nov 19, 2020Updated 5 years ago
- ☆15Nov 28, 2024Updated last year
- ☆16Jul 5, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Globally reasoned multi-task model for surgical scene understanding. A multi-task model for segmentation and scene graph. Offical Impleme…☆14May 5, 2022Updated 3 years ago
- VQA-Med 2021☆22Jul 11, 2022Updated 3 years ago
- ☆38Apr 5, 2025Updated last year
- ☆19Dec 19, 2025Updated 3 months ago
- ☆17May 19, 2023Updated 2 years ago
- ☆15Jul 4, 2023Updated 2 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆26Mar 28, 2023Updated 3 years ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- CholecTriplet 2022 challenge on surgical action triplet detection☆12Sep 17, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official repository of the GraSP dataset and implemention of TAPIS☆52Dec 31, 2024Updated last year
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆39Mar 22, 2021Updated 5 years ago
- ☆17Sep 17, 2025Updated 7 months ago
- Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…☆17Jul 7, 2024Updated last year
- ☆20Sep 19, 2025Updated 6 months ago
- Localized questions for VQA☆11May 6, 2025Updated 11 months ago
- Visual Question Answering in the Medical Domain VQA-Med 2019☆94Jan 12, 2024Updated 2 years ago
- SurgLaVi: Official repository☆30Mar 4, 2026Updated last month
- This repository is made for the paper: Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medica…☆47Jul 10, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆30Sep 16, 2024Updated last year
- Pytorch implementation for MICCAI2022 - Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You Need☆16Aug 4, 2023Updated 2 years ago
- (TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery☆26Nov 13, 2024Updated last year
- A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…☆81Sep 17, 2025Updated 7 months ago
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆47May 23, 2025Updated 10 months ago
- ☆16May 31, 2024Updated last year
- [IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering☆16Oct 9, 2022Updated 3 years ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆62Jul 5, 2025Updated 9 months ago
- Indexity is a web-based tool designed for medical video annotation in surgical data science projects.☆11Jun 27, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- ☆11Sep 17, 2025Updated 7 months ago
- Large-scale Self-supervised Pre-training for Endoscopy☆49Jun 11, 2024Updated last year
- [EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning☆28Jun 12, 2025Updated 10 months ago
- ☆71Feb 3, 2025Updated last year
- Laparoscopic video dataset for surgical action triplet recognition☆43Sep 17, 2025Updated 7 months ago
- There are compilations of surgery-related tasks, datasets, and papers.☆164Apr 3, 2026Updated 2 weeks ago