Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"
☆46Apr 19, 2024Updated last year
Alternatives and similar repositories for GSViT
Users that are interested in GSViT are comparing it to the libraries listed below
Sorting:
- ☆42Feb 16, 2026Updated 2 weeks ago
- Official repository of the GraSP dataset and implemention of TAPIS☆50Dec 31, 2024Updated last year
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆79Sep 14, 2025Updated 5 months ago
- ☆19Sep 19, 2025Updated 5 months ago
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆50Nov 25, 2025Updated 3 months ago
- Large-scale Self-supervised Pre-training for Endoscopy☆44Jun 11, 2024Updated last year
- ☆10Oct 7, 2023Updated 2 years ago
- 【CVPR2026】Official repository for the paper "LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgical …☆81Updated this week
- ☆13Jun 26, 2022Updated 3 years ago
- Code and models for MICCAI23 paper: "Self-Supervised Learning for Endoscopy Video Analysis".☆21Oct 2, 2023Updated 2 years ago
- There are compilations of surgery-related tasks, datasets, and papers.☆149Nov 9, 2025Updated 3 months ago
- [Nature Biomedical Engineering 2023] Decoding surgical activity from videos with a vision transformer☆22Jun 6, 2024Updated last year
- Official code of the paper "EgoExOR: EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding" accepted at …☆25Feb 20, 2026Updated last week
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆62Mar 27, 2023Updated 2 years ago
- List of surgical tool datasets organised by task.☆171Aug 30, 2024Updated last year
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆28Nov 25, 2024Updated last year
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 9 months ago
- [MICCAI 2024] Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition☆44Aug 28, 2025Updated 6 months ago
- CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery☆15Dec 18, 2025Updated 2 months ago
- ☆51Jun 12, 2025Updated 8 months ago
- Official pytorch implementation of MuST: Multi-Scale Transformers for Surgical Phase Recognition MICCAI 2024☆15Jan 13, 2025Updated last year
- ☆29Feb 7, 2024Updated 2 years ago
- We constructed the first multi-center neurosurgical workflow imaging dataset, and developed the AI-NeuroAdvisor intelligent surgical phas…☆77Dec 15, 2025Updated 2 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Sep 17, 2022Updated 3 years ago
- ☆13Nov 19, 2020Updated 5 years ago
- ☆16May 25, 2018Updated 7 years ago
- Learning by Aligning Videos in Time (CVPR 2021)☆14Sep 10, 2023Updated 2 years ago
- ☆14Dec 14, 2024Updated last year
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆32Jun 4, 2025Updated 8 months ago
- Official implementation of SurgicalPart-SAM (SP-SAM)☆13Mar 26, 2024Updated last year
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆61Jul 5, 2025Updated 7 months ago
- Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)☆149Feb 4, 2026Updated 3 weeks ago
- ☆14Nov 28, 2024Updated last year
- ORBIT-Surgical: An Open-Simulation Framework for Learning Surgical Augmented Dexterity☆160Dec 16, 2024Updated last year
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆31Jan 31, 2023Updated 3 years ago
- ☆66Feb 1, 2024Updated 2 years ago
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆38Aug 24, 2025Updated 6 months ago
- [ICCV 2021] Official implementation of 'Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation', in Pytorch.☆16May 12, 2023Updated 2 years ago
- [MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train☆215Oct 11, 2025Updated 4 months ago