aperezr20 / SurgLaViView external linksLinks
SurgLaVi: Large-Scale Hierarchical Datasets for Surgical Vision–Language Representation Learning
☆23Feb 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for SurgLaVi
Users that are interested in SurgLaVi are comparing it to the libraries listed below
Sorting:
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆30Jun 4, 2025Updated 8 months ago
- ☆18Sep 19, 2025Updated 4 months ago
- Official repository of the GraSP dataset and implemention of TAPIS☆50Dec 31, 2024Updated last year
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆79Sep 14, 2025Updated 5 months ago
- Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)☆149Feb 4, 2026Updated last week
- Large-scale semi-supervised framework with 1B+ labeled masks from 48K+ datasets with test-time adaptation to new domains (ICCV25).☆43Dec 28, 2025Updated last month
- 李宏毅机器学习课程笔记☆10Jul 3, 2022Updated 3 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 2 months ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 8 months ago
- Tik-Tok Lite版后端☆11Sep 18, 2023Updated 2 years ago
- Code for "A Temporally-Aware Interpolation Network for Video Frame Inpainting"☆10Jul 22, 2023Updated 2 years ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆24Aug 8, 2025Updated 6 months ago
- Reimplementation of paper "Image Super-Resolution by Neural Texture Transfer" in CVPR 2019 by pytorch.☆12Oct 22, 2019Updated 6 years ago
- Papers from the intersection of surgery and data science / machine learning☆15Jan 28, 2024Updated 2 years ago
- ☆10Aug 1, 2021Updated 4 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- ☆13Jan 25, 2024Updated 2 years ago
- ☆14Nov 28, 2021Updated 4 years ago
- [MICCAI 2022] Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency☆12Nov 8, 2024Updated last year
- Algorithm submission template for the TopCoW 2024 challenge on grand-challenge☆11Feb 9, 2025Updated last year
- Learning by Aligning Videos in Time (CVPR 2021)☆14Sep 10, 2023Updated 2 years ago
- ☆13Nov 19, 2020Updated 5 years ago
- [ECCV 2024] Code for "Unleashing the Power of Prompt-driven Nucleus Instance Segmentation"☆57Jan 9, 2025Updated last year
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- Official repository for "Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars" (NeurIPS 2023)☆17Oct 26, 2023Updated 2 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆27Jan 10, 2026Updated last month
- Code and data for our paper, "FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation."☆17Sep 19, 2025Updated 4 months ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆60Jul 5, 2025Updated 7 months ago
- This is an official implementation for "Bidirectional Semi-supervised Dual-branch CNN for Robust 3D Reconstruction of Stereo Endoscopic I…☆13Apr 14, 2023Updated 2 years ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆62Mar 27, 2023Updated 2 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- ☆13Jun 26, 2022Updated 3 years ago
- ☆12Apr 6, 2023Updated 2 years ago
- [IJCAI 2025] In-Context Meta LoRA Generation☆30Jul 29, 2025Updated 6 months ago
- Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL …☆23Jul 12, 2025Updated 7 months ago
- Code repository of AI-Endo☆16Jan 16, 2024Updated 2 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- ☆14Nov 28, 2024Updated last year