isyangshu / SurgVISTALinks
Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"
☆25Updated 6 months ago
Alternatives and similar repositories for SurgVISTA
Users that are interested in SurgVISTA are comparing it to the libraries listed below
Sorting:
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆111Updated 8 months ago
- Code implementation of RP3D-Diag☆75Updated 3 months ago
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆174Updated last week
- [MICCAI 2024] Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition☆40Updated 3 months ago
- ☆38Updated last month
- There are compilations of surgery-related tasks, datasets, and papers.☆126Updated last month
- ☆57Updated 2 months ago
- Official repository of the GraSP dataset and implemention of TAPIS☆45Updated 11 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆43Updated last year
- (AAAI-2025 oral) LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts☆52Updated 5 months ago
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆10Updated last year
- (TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery☆25Updated last year
- ICCV 2023, "GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video Segmentation"☆54Updated last year
- [MICCAI 2025 Best Paper Award Runner-up] Learning Segmentation from Radiology Reports☆87Updated 2 weeks ago
- An offcial implementation for UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training☆35Updated 9 months ago
- CVPR 2024 (Highlight)☆142Updated last year
- ☆44Updated 5 months ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆32Updated last year
- ☆22Updated 8 months ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆76Updated 2 months ago
- ☆43Updated 9 months ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆59Updated 5 months ago
- MICCAI 2024: Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images☆25Updated 8 months ago
- ☆34Updated 7 months ago
- Official repository for the paper "Prototype Representation Joint Learning from Medical Images and Reports, ICCV 2023".☆75Updated 2 years ago
- The official codes for "M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging"☆33Updated 4 months ago
- The official repository to build SAT-DS, a medical data collection of over 72 public segmentation datasets, contains over 22K 3D images, …☆132Updated last week
- [NeurIPS 2023] Text Promptable Surgical Instrument Segmentation with Vision-Language Models☆43Updated last year
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆83Updated 6 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Updated 3 years ago