[ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"
☆62Jul 5, 2025Updated 10 months ago
Alternatives and similar repositories for OphNet-benchmark
Users that are interested in OphNet-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jan 12, 2024Updated 2 years ago
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆44Nov 30, 2024Updated last year
- ☆16Jul 5, 2021Updated 4 years ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆81Sep 14, 2025Updated 7 months ago
- TMI 2023: Less is More: Surgical Phase Recognition from Timestamp Supervision☆22Feb 9, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- IEEE TMI 2022: Exploring Segment-level Semantics for Online Phase Recognition from Surgical Videos☆15Jun 27, 2022Updated 3 years ago
- ☆30Sep 16, 2024Updated last year
- ☆18Sep 19, 2024Updated last year
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆29Nov 25, 2024Updated last year
- ☆20Sep 19, 2025Updated 7 months ago
- Official repository of the GraSP dataset and implemention of TAPIS☆53Dec 31, 2024Updated last year
- [NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis☆61Mar 19, 2026Updated last month
- ☆13Jun 26, 2022Updated 3 years ago
- This repository contains the implementation of the methods presented in the paper "Effective semantic segmentation in Cataract Surgery: W…☆20Jun 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆56Mar 2, 2026Updated 2 months ago
- There are compilations of surgery-related tasks, datasets, and papers.☆167Apr 3, 2026Updated last month
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆42Jun 4, 2025Updated 11 months ago
- Reading list for deep learning in Computer Vision and Medical Image Analysis☆12Nov 2, 2021Updated 4 years ago
- ☆46Feb 16, 2026Updated 2 months ago
- ☆38Apr 5, 2025Updated last year
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆18Feb 12, 2025Updated last year
- A transformer-inspired neural network for surgical action triplet recognition from laparoscopic videos.☆33Sep 17, 2025Updated 7 months ago
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…☆26Feb 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Oct 5, 2023Updated 2 years ago
- [IEEE TPAMI 2025] This repository is the official implementation of the paper "VisionUnite: A Vision-Language Foundation Model for Ophtha…☆58Feb 2, 2026Updated 3 months ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆48Apr 19, 2024Updated 2 years ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆65Mar 27, 2023Updated 3 years ago
- Dataset for multi-perspective surgical tool tracking☆35Feb 21, 2026Updated 2 months ago
- Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…☆55Aug 27, 2025Updated 8 months ago
- [MedIA'25] FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.☆178Nov 27, 2025Updated 5 months ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- SurgLaVi: Official repository☆32Mar 4, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2025] MonoBox: Tightness-free Box-supervised Polyp Segmentation using Monotonicity Constraint☆19Dec 5, 2025Updated 4 months ago
- [CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?☆29Nov 11, 2024Updated last year
- Next-generation dermatology FM☆19Apr 13, 2026Updated 3 weeks ago
- The official code to build up dataset PMC-OA☆34Jul 16, 2024Updated last year
- This repository contains video datasets that can be used for training coarse to fine-grained (phase, step and action) temporal classifica…☆16Oct 26, 2021Updated 4 years ago
- Papers from the intersection of surgery and data science / machine learning☆15Jan 28, 2024Updated 2 years ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆15Mar 27, 2026Updated last month