CUHK-AIM-Group / EndoBenchLinks
[NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
☆56Updated 2 months ago
Alternatives and similar repositories for EndoBench
Users that are interested in EndoBench are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆59Updated 6 months ago
- Generative Enhancement for 3D Medical Images☆75Updated last year
- ☆58Updated last year
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆37Updated 3 months ago
- [ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion☆22Updated 6 months ago
- Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)☆148Updated 8 months ago
- [NeurIPS 2023] Text Promptable Surgical Instrument Segmentation with Vision-Language Models☆43Updated 2 years ago
- Discover the repository for "ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting," a pioneering study that…☆27Updated last year
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆44Updated last year
- [TPAMI 2024] Measurement Guidance in Difffusion Models: Insight from Medical Image Synthesis☆53Updated last year
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆109Updated last year
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆85Updated 7 months ago
- (AAAI-2025 oral) LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts☆52Updated 6 months ago
- This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…☆65Updated 6 months ago
- Improved tumor synthesis leveraging radiology reports as prompts for diffusion models.☆37Updated 2 weeks ago
- ☆25Updated 3 months ago
- ☆10Updated 2 years ago
- The official code for MedAgent_Pro☆84Updated 4 months ago
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆40Updated 6 months ago
- [NeurIPS'25][OralGPT & MMOral] The official repo of OralGPT & MMOral Bench.☆54Updated last week
- This repository contains source code to train and evaluate the vision-centric foundation model CheXFound.☆19Updated 3 months ago
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆114Updated 9 months ago
- ☆39Updated 2 months ago
- DeepTumorVQA benchmark (9262 CT images + 395k QA pairs)☆30Updated 6 months ago
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆44Updated 2 weeks ago
- A framework for Longitudinal Radiology Report Generation☆26Updated last year
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆26Updated 2 months ago
- ☆21Updated 3 weeks ago
- Official Code for our CVPR 2024 Paper "Diversified and Personalized Multi-rater Medical Image Segmentation" (Highlight)☆87Updated last month
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆18Updated last year