isyangshu / Awesome-Surgical-Video-UnderstandingLinks
There are compilations of surgery-related tasks, datasets, and papers.
☆126Updated last month
Alternatives and similar repositories for Awesome-Surgical-Video-Understanding
Users that are interested in Awesome-Surgical-Video-Understanding are comparing it to the libraries listed below
Sorting:
- [MICCAI 2024] Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition☆40Updated 3 months ago
- Official repository of the GraSP dataset and implemention of TAPIS☆45Updated 11 months ago
- [npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"☆260Updated 3 weeks ago
- The official repository to build SAT-DS, a medical data collection of over 72 public segmentation datasets, contains over 22K 3D images, …☆132Updated last week
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆25Updated 6 months ago
- ☆38Updated 9 months ago
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆111Updated 8 months ago
- Code implementation of RP3D-Diag☆75Updated 3 months ago
- Official implementation of MedCLIP-SAM (MICCAI 2024)☆126Updated 4 months ago
- CVPR 2024 (Highlight)☆142Updated last year
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging☆113Updated last year
- paper list, dataset, and tools for radiology report generation☆310Updated this week
- One-Prompt to Segment All Medical Images [CVPR 2024]☆140Updated last year
- Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development☆257Updated 2 months ago
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆176Updated last week
- ☆98Updated last month
- [NeurIPS'22] Multi-Granularity Cross-modal Alignment for Generalized Medical Visual Representation Learning☆174Updated last year
- ☆38Updated last month
- PyTorch implementation for MA-SAM☆173Updated 4 months ago
- ☆57Updated 2 months ago
- RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports☆61Updated last year
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆76Updated 2 months ago
- ☆152Updated last year
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆399Updated 7 months ago
- Official repository of paper titled "UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalitie…☆146Updated 7 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Updated 3 years ago
- A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…☆69Updated 2 months ago
- [MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train☆208Updated 2 months ago
- ☆189Updated 2 months ago
- MICCAI 2023 Paper (Early Acceptance)☆189Updated 2 years ago