egeozsoy/MM-OR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/egeozsoy/MM-OR)

egeozsoy / MM-OR

Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments accepted at CVPR 2025. This repo includes both the dataset and our code.

☆59

Alternatives and similar repositories for MM-OR

Users that are interested in MM-OR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ardamamur / EgoExOR
View on GitHub
Official code of the paper "EgoExOR: EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding" accepted at …
☆28May 6, 2026Updated 2 months ago
egeozsoy / 4D-OR
View on GitHub
Official code of the paper 4D-OR: Semantic Scene Graphs for OR Domain Modeling accepted at MICCAI 2022. This repo includes both the datas…
☆63Mar 29, 2025Updated last year
egeozsoy / ORacle
View on GitHub
Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.
☆25Jan 6, 2025Updated last year
CAMMA-public / SSG-VQA
View on GitHub
[IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge
☆52May 23, 2025Updated last year
CAMMA-public / SurgVLP
View on GitHub
[MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures
☆86Sep 14, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Fujiry0 / EgoSurgery
View on GitHub
[MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"
☆28Nov 25, 2024Updated last year
isyangshu / Awesome-Surgical-Video-Understanding
View on GitHub
There are compilations of surgery-related tasks, datasets, and papers.
☆183Apr 3, 2026Updated 3 months ago
BCV-Uniandes / ISINet
View on GitHub
Pytorch implementation of the MICCAI 2020 paper ISINet: An Instance-Based Approach for Surgical Instrument Segmentation.
☆27Oct 12, 2021Updated 4 years ago
isyangshu / Surgformer
View on GitHub
[MICCAI 2024] Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition
☆51Aug 28, 2025Updated 10 months ago
SamuelSchmidgall / GSViT
View on GitHub
Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"
☆51Apr 19, 2024Updated 2 years ago
mobarakol / PitVQA
View on GitHub
☆21Dec 19, 2025Updated 7 months ago
tobiascz / TeCNO
View on GitHub
☆71Feb 1, 2024Updated 2 years ago
franciszchen / SurgBox
View on GitHub
☆16Dec 14, 2024Updated last year
ccccchenllll / SGT_master
View on GitHub
☆16Nov 28, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆25May 12, 2026Updated 2 months ago
minghu0830 / OphNet-benchmark
View on GitHub
[ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"
☆63Jul 5, 2025Updated last year
isyangshu / SurgVISTA
View on GitHub
Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"
☆52Jun 4, 2025Updated last year
pqpq17 / Awesome-LLM-Reasoning-on-Medicine
View on GitHub
The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning
☆24Apr 7, 2026Updated 3 months ago
XuMengyaAmy / ReportDALS
View on GitHub
☆16Nov 19, 2020Updated 5 years ago
jinlab-imvr / SurgVLM
View on GitHub
☆66Apr 21, 2026Updated 3 months ago
aperezr20 / SurgLaVi
View on GitHub
SurgLaVi: Official repository
☆40Jul 8, 2026Updated 2 weeks ago
open-h / open-h-embodiment
View on GitHub
Open-H-Embodiment is a community‑driven dataset initiative building the open, shared foundation needed to train and evaluate a generalist…
☆131Jun 12, 2026Updated last month
jinlab-imvr / ReSurgSAM2
View on GitHub
[MICCAI 2025] Official code implementation for paper: ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tra…
☆43Nov 4, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
longbai1006 / Surgical-VQLA
View on GitHub
Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…
☆27Jul 7, 2024Updated 2 years ago
gkw0010 / EndoChat
View on GitHub
☆51Feb 16, 2026Updated 5 months ago
TimJaspers0801 / SurgeNet
View on GitHub
[MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"
☆61Mar 2, 2026Updated 4 months ago
zixinyang9109 / LiverMatch
View on GitHub
Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration
☆26Apr 8, 2026Updated 3 months ago
RoyHirsch / endossl
View on GitHub
Code and models for MICCAI23 paper: "Self-Supervised Learning for Endoscopy Video Analysis".
☆25Oct 2, 2023Updated 2 years ago
XuMengyaAmy / SwinMLP_TranCAP
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
Project-MONAI / VLM-Surgical-Agent-Framework
View on GitHub
Multi-modal agentic framework for surgical procedures
☆41Mar 14, 2026Updated 4 months ago
kathrin229 / 3d-transformer-med-classification
View on GitHub
Repository for Master thesis project investigating classification of 3D chest CT scans using Vision Transformer.
☆15Aug 29, 2023Updated 2 years ago
CAMMA-public / MVOR
View on GitHub
Multi-View Operating Room (MVOR) dataset consists of synchronized multi-view frames recorded by three RGB-D cameras in a hybrid OR during…
☆75May 23, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Ruggero1912 / Patch-ioner
View on GitHub
[CVPR 2026] Official Repository of the Paper "One Patch to Caption Them All A Unified Zero-Shot Captioning Framework"
☆15Jun 4, 2026Updated last month
visurg-ai / LEMON
View on GitHub
[CVPR 2026] Official repository for the paper "LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgica…
☆97Jul 4, 2026Updated 2 weeks ago
chenxy99 / GazeXplain
View on GitHub
[ECCV 2024 Oral] GazeXplain - Official PyTorch Implementation
☆17Feb 24, 2025Updated last year
tangzhengxu2001 / m4oe
View on GitHub
☆16Apr 3, 2025Updated last year
CUHK-AIM-Group / EndoBench
View on GitHub
[NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
☆65Mar 19, 2026Updated 4 months ago
mobarakol / SurgicalAICopilot
View on GitHub
☆18Jan 20, 2026Updated 6 months ago
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year