☆16Jul 5, 2021Updated 4 years ago
Alternatives and similar repositories for CIDACaptioning
Users that are interested in CIDACaptioning are comparing it to the libraries listed below
Sorting:
- ☆13Jun 26, 2022Updated 3 years ago
- ☆10Oct 7, 2023Updated 2 years ago
- ☆13Nov 19, 2020Updated 5 years ago
- ☆14Nov 28, 2024Updated last year
- ☆19Dec 19, 2025Updated 2 months ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆62Mar 27, 2023Updated 2 years ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery☆15Dec 18, 2025Updated 2 months ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆79Sep 14, 2025Updated 5 months ago
- rendezvous-in-time☆12Sep 17, 2025Updated 5 months ago
- ☆18Nov 11, 2022Updated 3 years ago
- ☆15May 31, 2024Updated last year
- Pytorch implementation for MICCAI2022 - Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You Need☆16Aug 4, 2023Updated 2 years ago
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆24Jul 7, 2024Updated last year
- IEEE TMI 2022: Exploring Segment-level Semantics for Online Phase Recognition from Surgical Videos☆15Jun 27, 2022Updated 3 years ago
- CholecTriplet 2022 challenge on surgical action triplet detection☆12Sep 17, 2025Updated 5 months ago
- A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrume…☆75Sep 17, 2025Updated 5 months ago
- Official repository for "Dissecting Self-Supervised Learning Methods for Surgical Computer Vision"☆43May 23, 2025Updated 9 months ago
- Official repository of the GraSP dataset and implemention of TAPIS☆50Dec 31, 2024Updated last year
- ☆15Jul 4, 2023Updated 2 years ago
- [MedIA'22] Anticipation for surgical workflow through instrument interaction and recognized signals☆17Feb 11, 2022Updated 4 years ago
- [MICCAI'21] Personalized Retrogress-Resilient Framework for Real-World Medical Federated Learning☆17Mar 23, 2022Updated 3 years ago
- [MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"☆28Nov 25, 2024Updated last year
- TMI 2023: Less is More: Surgical Phase Recognition from Timestamp Supervision☆21Feb 9, 2023Updated 3 years ago
- Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment☆59Sep 17, 2025Updated 5 months ago
- List of surgical tool datasets organised by task.☆171Aug 30, 2024Updated last year
- ☆51Jun 12, 2025Updated 8 months ago
- ☆29Feb 7, 2024Updated 2 years ago
- A transformer-inspired neural network for surgical action triplet recognition from laparoscopic videos.☆32Sep 17, 2025Updated 5 months ago
- ☆69Feb 3, 2025Updated last year
- Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)☆149Feb 4, 2026Updated last month
- Segment-Anything-2 (SAM 2) fine tune with COCO data☆14Aug 20, 2024Updated last year
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆46Apr 19, 2024Updated last year
- ☆13Jul 20, 2023Updated 2 years ago
- Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images☆11Jan 11, 2024Updated 2 years ago
- ☆12May 22, 2022Updated 3 years ago
- Code for Paper Predicting Osteoarthritis Progression via Unsupervised Adversarial Representation Learning☆13Aug 11, 2022Updated 3 years ago
- A conversational LoRA for OPT 2.7b☆10Apr 28, 2023Updated 2 years ago
- 重构nerf代码,更加容易读懂☆13Mar 26, 2023Updated 2 years ago