Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"
β19Jun 2, 2025Updated 9 months ago
Alternatives and similar repositories for SurgBenchKit
Users that are interested in SurgBenchKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β34Nov 25, 2025Updated 3 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literatureβ95Mar 22, 2025Updated last year
- β41Sep 9, 2025Updated 6 months ago
- [ICLR 2025] Video Action Differencingβ52Jul 3, 2025Updated 8 months ago
- β20Apr 8, 2025Updated 11 months ago
- Histopathology Feature Extractors (2024)β14Jun 14, 2024Updated last year
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"β25Feb 21, 2025Updated last year
- A Vision-Language Benchmark for Microscopy Understandingβ32Mar 13, 2025Updated last year
- Active Learning in the era of Foundation Modelsβ12Apr 16, 2025Updated 11 months ago
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP modelsβ35Mar 23, 2025Updated last year
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selectionβ21Feb 3, 2024Updated 2 years ago
- β12Mar 18, 2024Updated 2 years ago
- Interactive Continual Semantic Segmentationβ12Apr 13, 2022Updated 3 years ago
- Radiology Language Evaluationsβ11Nov 17, 2023Updated 2 years ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202β¦β40May 26, 2025Updated 9 months ago
- Python interface and preprocessing pipeline for the BBBC021 dataset of cellular imagesβ14Sep 19, 2021Updated 4 years ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lecturesβ80Sep 14, 2025Updated 6 months ago
- Fast Pythonic data structures and tools for wrangling medical images.β28Jul 21, 2025Updated 8 months ago
- Neural Network architecture Visualization toolβ13Jul 4, 2020Updated 5 years ago
- β22Nov 27, 2025Updated 3 months ago
- Univariate guided lassoβ24Jan 22, 2026Updated 2 months ago
- [Nature Communications] O2VAE: a model for orientation-invariant representation learning (phenotyping) in cell biology dataβ38Mar 26, 2025Updated 11 months ago
- An official implementation of "GOALβ½: Global-local Object Alignment Learning" (CVPR 2025).β27Aug 14, 2025Updated 7 months ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial Viewβ13Jun 5, 2024Updated last year
- Official repository for the ICCV2023 paper "Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV"β84Mar 5, 2024Updated 2 years ago
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Modelsβ111Dec 3, 2024Updated last year
- β32Oct 6, 2024Updated last year
- π The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.β17Jun 7, 2021Updated 4 years ago
- β16Jan 28, 2026Updated last month
- Official Pytorch code for Open World Object Detection in the Era of Foundation Modelsβ94Jan 26, 2024Updated 2 years ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.β98May 17, 2025Updated 10 months ago
- Code and annotation for the paper "Towards Accurate and Interpretable Surgical Skill Assessment: A Video-Based Method Incorporating Recogβ¦β12Jan 20, 2023Updated 3 years ago
- β32Oct 18, 2024Updated last year
- Final project for ECE239AS Deep Learning and Neural Network at UCLA, Winter 2019β11Jul 5, 2024Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β132Nov 5, 2025Updated 4 months ago
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Modelsβ14Jul 20, 2023Updated 2 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Modelsβ17Jun 28, 2025Updated 8 months ago
- ShapeEmbedLite: a lightweight self-supervised representation learning model for 2D shape analysisβ21Oct 13, 2025Updated 5 months ago
- Multimodal encoder-only transformer model for image-based protein predictionsβ15Dec 12, 2023Updated 2 years ago