med-air / AI-Endo
Code repository of AI-Endo
☆10Updated last year
Alternatives and similar repositories for AI-Endo:
Users that are interested in AI-Endo are comparing it to the libraries listed below
- Learning multi-modal representations by watching hundreds of surgical video lectures☆59Updated this week
- ☆18Updated 6 months ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆22Updated 3 months ago
- [MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train☆187Updated last year
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆59Updated 2 weeks ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆32Updated last year
- ☆56Updated 10 months ago
- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding☆41Updated last month
- Large-scale Self-supervised Pre-training for Endoscopy☆32Updated 10 months ago
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"☆26Updated this week
- A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…☆52Updated last year
- ☆26Updated 9 months ago
- SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgi…☆34Updated 7 months ago
- ☆33Updated 3 weeks ago
- Code implementation of RP3D-Diag☆68Updated 4 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Updated 2 years ago
- ☆20Updated 3 months ago
- ☆15Updated 7 months ago
- Official Code Release for "Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segment…☆48Updated last year
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆27Updated 6 months ago
- ICCV 2023, "GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video Segmentation"☆46Updated 10 months ago
- (TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery☆19Updated 5 months ago
- The official implementation of "ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training"☆38Updated 2 weeks ago
- Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment☆44Updated 2 months ago
- ☆34Updated last week
- Code and models for MICCAI23 paper: "Self-Supervised Learning for Endoscopy Video Analysis".☆16Updated last year
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆185Updated last week
- An offcial implementation for UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training☆28Updated last month
- ☆68Updated 10 months ago
- [EMNLP, Findings 2024] a radiology report generation metric that leverages the natural language understanding of language models to ident…☆45Updated last month