Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"
β20Jun 2, 2025Updated 11 months ago
Alternatives and similar repositories for SurgBenchKit
Users that are interested in SurgBenchKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β38Nov 25, 2025Updated 5 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literatureβ103Mar 22, 2025Updated last year
- β41Sep 9, 2025Updated 8 months ago
- [ICLR 2025] Video Action Differencingβ53Jul 3, 2025Updated 10 months ago
- β20Apr 8, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Histopathology Feature Extractors (2024)β14Jun 14, 2024Updated last year
- A Vision-Language Benchmark for Microscopy Understandingβ31Mar 13, 2025Updated last year
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Mβ¦β26May 12, 2026Updated last week
- Active Learning in the era of Foundation Modelsβ13Apr 16, 2025Updated last year
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP modelsβ37Mar 23, 2025Updated last year
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selectionβ21Feb 3, 2024Updated 2 years ago
- β12Mar 18, 2024Updated 2 years ago
- Interactive Continual Semantic Segmentationβ12Apr 13, 2022Updated 4 years ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202β¦β40May 26, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Python interface and preprocessing pipeline for the BBBC021 dataset of cellular imagesβ14Sep 19, 2021Updated 4 years ago
- Radiology Language Evaluationsβ11Nov 17, 2023Updated 2 years ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lecturesβ80Sep 14, 2025Updated 8 months ago
- Fast Pythonic data structures and tools for wrangling medical images.β28Jul 21, 2025Updated 10 months ago
- Neural Network architecture Visualization toolβ13Jul 4, 2020Updated 5 years ago
- β24Nov 27, 2025Updated 5 months ago
- Univariate guided lassoβ24Jan 22, 2026Updated 4 months ago
- [Nature Communications] O2VAE: a model for orientation-invariant representation learning (phenotyping) in cell biology dataβ39Mar 26, 2025Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial Viewβ13Jun 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository for the ICCV2023 paper "Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV"β84Mar 5, 2024Updated 2 years ago
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Modelsβ111Dec 3, 2024Updated last year
- An official implementation of "GOALβ½: Global-local Object Alignment Learning" (CVPR 2025).β35Aug 14, 2025Updated 9 months ago
- β33Oct 6, 2024Updated last year
- β18May 14, 2026Updated last week
- π The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.β17Jun 7, 2021Updated 4 years ago
- Official Pytorch code for Open World Object Detection in the Era of Foundation Modelsβ95Jan 26, 2024Updated 2 years ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.β99May 17, 2025Updated last year
- Code and annotation for the paper "Towards Accurate and Interpretable Surgical Skill Assessment: A Video-Based Method Incorporating Recogβ¦β12Jan 20, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- β32Oct 18, 2024Updated last year
- Final project for ECE239AS Deep Learning and Neural Network at UCLA, Winter 2019β11Jul 5, 2024Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β133Nov 5, 2025Updated 6 months ago
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Modelsβ14Jul 20, 2023Updated 2 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Modelsβ17Jun 28, 2025Updated 10 months ago
- ShapeEmbedLite: a lightweight self-supervised representation learning model for 2D shape analysisβ23Apr 23, 2026Updated last month
- Multimodal encoder-only transformer model for image-based protein predictionsβ15Dec 12, 2023Updated 2 years ago