A Vision-Language Benchmark for Microscopy Understanding
β30Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for Micro-Bench
Users that are interested in Micro-Bench are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β32Nov 25, 2025Updated 3 months ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"β19Jun 2, 2025Updated 9 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literatureβ92Mar 22, 2025Updated 11 months ago
- [Nature Communications] O2VAE: a model for orientation-invariant representation learning (phenotyping) in cell biology dataβ38Mar 26, 2025Updated 11 months ago
- [ICLR 2025] Video Action Differencingβ52Jul 3, 2025Updated 7 months ago
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selectionβ21Feb 3, 2024Updated 2 years ago
- Histopathology Feature Extractors (2024)β12Jun 14, 2024Updated last year
- Repository in Support of EAGLE Submissionβ21Oct 11, 2025Updated 4 months ago
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Modelsβ111Dec 3, 2024Updated last year
- Active Learning in the era of Foundation Modelsβ12Apr 16, 2025Updated 10 months ago
- Radiology Language Evaluationsβ11Nov 17, 2023Updated 2 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)β34Oct 16, 2024Updated last year
- Official Implement of the paper "Unifying Segment Anything in Microscopy with Multimodal Large Language Model"β20Dec 14, 2025Updated 2 months ago
- β17Jul 8, 2024Updated last year
- β15Jul 8, 2024Updated last year
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervisionβ72Jul 10, 2024Updated last year
- β22May 1, 2025Updated 10 months ago
- MorphoFeatures code and dataβ22May 4, 2023Updated 2 years ago
- Official Pytorch code for Open World Object Detection in the Era of Foundation Modelsβ93Jan 26, 2024Updated 2 years ago
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP modelsβ34Mar 23, 2025Updated 11 months ago
- Fast Pythonic data structures and tools for wrangling medical images.β28Jul 21, 2025Updated 7 months ago
- Atlas of Digital Pathology for Deep Learning [CVPR2019]β25Jun 22, 2020Updated 5 years ago
- Comprehensive, open-source Whole Slide Image (WSI) datasetβ48Jun 4, 2025Updated 8 months ago
- Tutorial on using Hugging Face's Vision Transformers for Image Classificationβ10Sep 4, 2021Updated 4 years ago
- Hibou: Foundational Models for Pathologyβ76Oct 23, 2024Updated last year
- Whole Slide image (WSI) conversion for brightfield histology imagesβ38Feb 23, 2026Updated last week
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202β¦β40May 26, 2025Updated 9 months ago
- The computational platform u-signal3D defines a shape-invariant representation of the spatial scales of molecular organization at the celβ¦β11Jan 22, 2026Updated last month
- Jupyter notebooks for analysis and figures related to the native organelle IP paperβ13Nov 13, 2025Updated 3 months ago
- Tutorial for Graph Neural Network at APBJC 2024.β10Apr 21, 2025Updated 10 months ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answeringβ¦β31Jan 31, 2023Updated 3 years ago
- An interpretable and flexible deep learning framework for single-T cell transcriptome and receptor analysisβ14Apr 5, 2025Updated 10 months ago
- YoloTeeth is a GitHub repository dedicated to leveraging YOLOv8 for precise instance segmentation and object detection in teeth X-ray imaβ¦β12Nov 10, 2024Updated last year
- β15Mar 12, 2024Updated last year
- How to use OpenAI API?β12Nov 23, 2023Updated 2 years ago
- This GitHub provides the source code for the paper "Exploring Facial Expression and Action Units in Parkinson Disease"β10Dec 21, 2022Updated 3 years ago
- β15Sep 26, 2020Updated 5 years ago
- A Multitask Conversational Vision-Language Model for Radiologyβ16Jul 3, 2025Updated 7 months ago
- Whether you're a beginner exploring LangChain or an advanced practitioner building scalable GenAI applications, this tutorial-style projeβ¦β13Feb 10, 2026Updated 2 weeks ago