minghu0830/OphNet-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/minghu0830/OphNet-benchmark)

minghu0830 / OphNet-benchmark

[ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"

☆63

Alternatives and similar repositories for OphNet-benchmark

Users that are interested in OphNet-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

minghu0830 / NurViD-benchmark
View on GitHub
☆23Jan 12, 2024Updated 2 years ago
richard-peng-xia / HGCLIP
View on GitHub
[COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
☆45Nov 30, 2024Updated last year
XuMengyaAmy / CIDACaptioning
View on GitHub
☆17Jul 5, 2021Updated 5 years ago
CAMMA-public / SurgVLP
View on GitHub
[MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures
☆86Sep 14, 2025Updated 10 months ago
xmed-lab / TimeStamp-Surgical
View on GitHub
TMI 2023: Less is More: Surgical Phase Recognition from Timestamp Supervision
☆22Feb 9, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
isyangshu / Awesome-Surgical-Video-Understanding
View on GitHub
There are compilations of surgery-related tasks, datasets, and papers.
☆183Apr 3, 2026Updated 3 months ago
xmed-lab / SAHC
View on GitHub
IEEE TMI 2022: Exploring Segment-level Semantics for Online Phase Recognition from Surgical Videos
☆17Jun 27, 2022Updated 4 years ago
yhygao / Explicd
View on GitHub
☆18Sep 19, 2024Updated last year
CAMMA-public / MultiBypass140
View on GitHub
☆22Sep 19, 2025Updated 10 months ago
XuMengyaAmy / SwinMLP_TranCAP
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
CAMMA-public / cholectrack20
View on GitHub
Dataset for multi-perspective surgical tool tracking
☆37Feb 21, 2026Updated 5 months ago
isyangshu / SurgVISTA
View on GitHub
Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"
☆52Jun 4, 2025Updated last year
RViMLab / MICCAI2021_Cataract_semantic_segmentation
View on GitHub
This repository contains the implementation of the methods presented in the paper "Effective semantic segmentation in Cataract Surgery: W…
☆20Jun 23, 2024Updated 2 years ago
Fujiry0 / EgoSurgery
View on GitHub
[MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"
☆28Nov 25, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TimJaspers0801 / SurgeNet
View on GitHub
[MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"
☆61Mar 2, 2026Updated 4 months ago
luiscarlosgph / list-of-surgical-tool-datasets
View on GitHub
List of surgical tool datasets organised by task.
☆177Aug 30, 2024Updated last year
Negin-Ghamsarian / Cataract-1K
View on GitHub
☆39Sep 16, 2024Updated last year
SamuelSchmidgall / GSViT
View on GitHub
Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"
☆51Apr 19, 2024Updated 2 years ago
Guo-Xiaoqing / Reading-List
View on GitHub
Reading list for deep learning in Computer Vision and Medical Image Analysis
☆12Nov 2, 2021Updated 4 years ago
BCV-Uniandes / TAPIR
View on GitHub
☆38Apr 5, 2025Updated last year
Flaick / Surgical-Workflow-Anticipation
View on GitHub
[MedIA'22] Anticipation for surgical workflow through instrument interaction and recognized signals
☆17Feb 11, 2022Updated 4 years ago
mendicant04 / DermoGPT
View on GitHub
☆17May 14, 2026Updated 2 months ago
jinlab-imvr / SurgVLM
View on GitHub
☆66Apr 21, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
BCV-Uniandes / GraSP
View on GitHub
Official repository of the GraSP dataset and implemention of TAPIS
☆56Dec 31, 2024Updated last year
GQBBBB / UCI
View on GitHub
☆10Oct 5, 2023Updated 2 years ago
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆25May 12, 2026Updated 2 months ago
CAMMA-public / rendezvous
View on GitHub
A transformer-inspired neural network for surgical action triplet recognition from laparoscopic videos.
☆39Sep 17, 2025Updated 10 months ago
CUHK-AIM-Group / EndoBench
View on GitHub
[NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
☆65Mar 19, 2026Updated 4 months ago
HUANGLIZI / VisionUnite
View on GitHub
[IEEE TPAMI 2025] This repository is the official implementation of the paper "VisionUnite: A Vision-Language Foundation Model for Ophtha…
☆58Jul 3, 2026Updated 3 weeks ago
GeWu-Lab / Diagnosing_Relearning_ECCV2024
View on GitHub
The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024
☆29Jul 30, 2024Updated last year
lalithjets / Surgical_VQA
View on GitHub
Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…
☆68Mar 27, 2023Updated 3 years ago
tobiascz / TeCNO
View on GitHub
☆71Feb 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
egeozsoy / ORacle
View on GitHub
Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.
☆25Jan 6, 2025Updated last year
egeozsoy / MM-OR
View on GitHub
Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…
☆59Aug 27, 2025Updated 10 months ago
richard-peng-xia / LMPT
View on GitHub
[ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
☆58Aug 13, 2024Updated last year
YtongXie / PairAug
View on GitHub
[CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
☆29Nov 11, 2024Updated last year
farrell236 / RATCHET
View on GitHub
RAdiological Text Captioning for Human Examined Thoraxes
☆46Sep 3, 2023Updated 2 years ago
WeixiongLin / Build-PMC-OA
View on GitHub
The official code to build up dataset PMC-OA
☆34Jul 16, 2024Updated 2 years ago
DeweiHu / AdaptDiff
View on GitHub
Weak conditional diffusion for domain adaptation
☆12Nov 4, 2024Updated last year