longbai1006 / Surgical-VQLAView external linksLinks
Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery", ICRA 2023.
☆24Jul 7, 2024Updated last year
Alternatives and similar repositories for Surgical-VQLA
Users that are interested in Surgical-VQLA are comparing it to the libraries listed below
Sorting:
- Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…☆17Jul 7, 2024Updated last year
- Globally reasoned multi-task model for surgical scene understanding. A multi-task model for segmentation and scene graph. Offical Impleme…☆14May 5, 2022Updated 3 years ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆62Mar 27, 2023Updated 2 years ago
- ☆19Dec 19, 2025Updated last month
- ☆13Nov 19, 2020Updated 5 years ago
- ☆16Jul 5, 2021Updated 4 years ago
- ☆15Jul 4, 2023Updated 2 years ago
- [MICCAI'22] AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy☆19Mar 21, 2025Updated 10 months ago
- Official implementation of “LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusio…☆20Jul 7, 2024Updated last year
- Official implementation of "EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy", MICCAI 2…☆11Jan 29, 2026Updated 2 weeks ago
- We constructed the first multi-center neurosurgical workflow imaging dataset, and developed the AI-NeuroAdvisor intelligent surgical phas…☆77Dec 15, 2025Updated 2 months ago
- Pytorch implementation for MICCAI2022 - Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You Need☆16Aug 4, 2023Updated 2 years ago
- ☆15May 31, 2024Updated last year
- ☆13Jun 26, 2022Updated 3 years ago
- ☆37Apr 5, 2025Updated 10 months ago
- [IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering☆16Oct 9, 2022Updated 3 years ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆46Apr 19, 2024Updated last year
- Implementation of ''VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation''☆15Sep 16, 2025Updated 5 months ago
- S2ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation (MICCAI 2023)☆19Dec 1, 2023Updated 2 years ago
- Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…☆52Aug 27, 2025Updated 5 months ago
- VQA-Med 2021☆22Jul 11, 2022Updated 3 years ago
- Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate a generalist…☆66Dec 20, 2025Updated last month
- [MICCAI 2025] Official code implementation for paper: ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tra…☆36Nov 4, 2025Updated 3 months ago
- Pytorch implementation of the MICCAI 2020 paper ISINet: An Instance-Based Approach for Surgical Instrument Segmentation.☆24Oct 12, 2021Updated 4 years ago
- Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment☆58Sep 17, 2025Updated 4 months ago
- [NeurIPS 2024 Workshop AIM-FM] Official code implementation for paper: Surgical SAM 2☆69Apr 15, 2025Updated 10 months ago
- Repository for the paper: Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models (https://arxiv.org/abs/23…☆19Sep 2, 2023Updated 2 years ago
- [TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation☆23Dec 20, 2022Updated 3 years ago
- ☆42Feb 8, 2026Updated last week
- ☆29Feb 7, 2024Updated 2 years ago
- A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…☆74Sep 17, 2025Updated 4 months ago
- Tackling View-Dependent Semantics in 3D Language Gaussian Splatting (ICML 2025)☆60Jun 3, 2025Updated 8 months ago
- code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering☆30May 30, 2025Updated 8 months ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- UR Robotic Arm with Robotiq 2-Finger Gripper for ROS2☆22Jul 10, 2025Updated 7 months ago
- ROS2 catestian_impedance_controller from PdZ☆11Oct 22, 2025Updated 3 months ago
- Segment-Anything-2 (SAM 2) fine tune with COCO data☆14Aug 20, 2024Updated last year
- [ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping☆11Feb 7, 2025Updated last year