CAMMA-public/SSG-VQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CAMMA-public/SSG-VQA)

CAMMA-public / SSG-VQA

[IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge

☆52

Alternatives and similar repositories for SSG-VQA

Users that are interested in SSG-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CAMMA-public / SurgVLP
View on GitHub
[MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures
☆86Sep 14, 2025Updated 10 months ago
BCV-Uniandes / GraSP
View on GitHub
Official repository of the GraSP dataset and implemention of TAPIS
☆56Dec 31, 2024Updated last year
CAMMA-public / MultiBypass140
View on GitHub
☆22Sep 19, 2025Updated 10 months ago
isyangshu / Awesome-Surgical-Video-Understanding
View on GitHub
There are compilations of surgery-related tasks, datasets, and papers.
☆184Apr 3, 2026Updated 3 months ago
Fujiry0 / EgoSurgery
View on GitHub
[MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"
☆28Nov 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BCV-Uniandes / TAPIR
View on GitHub
☆38Apr 5, 2025Updated last year
CAMMA-public / ivtmetrics
View on GitHub
A Python evaluation metrics package for surgical action triplet recognition
☆18Dec 10, 2024Updated last year
TimJaspers0801 / SurgeNet
View on GitHub
[MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"
☆61Mar 2, 2026Updated 4 months ago
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
jinlab-imvr / SurgVLM
View on GitHub
☆66Apr 21, 2026Updated 3 months ago
lalithjets / SurgicalGPT
View on GitHub
☆28Feb 7, 2024Updated 2 years ago
XuMengyaAmy / ReportDALS
View on GitHub
☆16Nov 19, 2020Updated 5 years ago
isyangshu / Surgformer
View on GitHub
[MICCAI 2024] Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition
☆51Aug 28, 2025Updated 10 months ago
SamuelSchmidgall / GSViT
View on GitHub
Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"
☆51Apr 19, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
egeozsoy / MM-OR
View on GitHub
Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…
☆59Aug 27, 2025Updated 10 months ago
wjhou / Recap
View on GitHub
[EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
☆28Jun 12, 2025Updated last year
CAMMA-public / tripnet
View on GitHub
☆18Sep 17, 2025Updated 10 months ago
lalithjets / Surgical_VQA
View on GitHub
Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…
☆68Mar 27, 2023Updated 3 years ago
ardamamur / EgoExOR
View on GitHub
Official code of the paper "EgoExOR: EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding" accepted at …
☆28May 6, 2026Updated 2 months ago
CAMMA-public / cholectrack20
View on GitHub
Dataset for multi-perspective surgical tool tracking
☆37Feb 21, 2026Updated 5 months ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
egeozsoy / ORacle
View on GitHub
Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.
☆25Jan 6, 2025Updated last year
anitarau / SurgBenchKit
View on GitHub
Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"
☆21Jun 2, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
XuMengyaAmy / SwinMLP_TranCAP
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
mobarakol / PitVQA
View on GitHub
☆21Dec 19, 2025Updated 7 months ago
xmed-lab / DistillingSelf
View on GitHub
MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions
☆13Sep 17, 2022Updated 3 years ago
gkw0010 / EndoChat
View on GitHub
☆51Feb 16, 2026Updated 5 months ago
CAMMA-public / cholect50
View on GitHub
A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…
☆85Sep 17, 2025Updated 10 months ago
ccccchenllll / SGT_master
View on GitHub
☆16Nov 28, 2024Updated last year
X-iZhang / Libra
View on GitHub
[ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…
☆30Mar 18, 2026Updated 4 months ago
cwangrun / CheXficient
View on GitHub
CheXficient
☆15Jun 28, 2026Updated 3 weeks ago
yeerwen / Awesome-Medical-Efficient-Fine-Tuning
View on GitHub
☆36Mar 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
visurg-ai / LEMON
View on GitHub
[CVPR 2026] Official repository for the paper "LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgica…
☆99Jul 4, 2026Updated 3 weeks ago
CAMMA-public / Endoscapes
View on GitHub
Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment
☆63Sep 17, 2025Updated 10 months ago
yuan-12138 / VesNet-RL
View on GitHub
VesNet-RL: Simulation-based ReinforcementLearning for Real-World US Probe Navigation
☆14Sep 27, 2023Updated 2 years ago
wjhou / Radar
View on GitHub
[ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection
☆34Jul 23, 2025Updated last year
ShawnHuang497 / BiRD
View on GitHub
The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'
☆34Nov 5, 2024Updated last year
Negin-Ghamsarian / Cataract-1K
View on GitHub
☆39Sep 16, 2024Updated last year
wjhou / ICon
View on GitHub
[EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation
☆19Dec 11, 2024Updated last year