CUHK-AIM-Group / EndoBenchLinks
[NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
☆49Updated 3 weeks ago
Alternatives and similar repositories for EndoBench
Users that are interested in EndoBench are comparing it to the libraries listed below
Sorting:
- ☆58Updated last year
- Generative Enhancement for 3D Medical Images☆70Updated last year
- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding☆57Updated 3 months ago
- [NeurIPS 2023] Text Promptable Surgical Instrument Segmentation with Vision-Language Models☆40Updated last year
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆33Updated 3 weeks ago
- This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…☆47Updated 3 months ago
- [ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion☆22Updated 4 months ago
- Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)☆142Updated 5 months ago
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆82Updated 4 months ago
- Discover the repository for "ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting," a pioneering study that…☆25Updated 10 months ago
- Improved tumor synthesis leveraging radiology reports as prompts for diffusion models.☆34Updated 7 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆42Updated 11 months ago
- [NeurIPS 2025][OralGPT & MMOral] Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Digital Dentistry☆33Updated last week
- ☆32Updated 6 months ago
- [IEEE TPAMI 2025] This repository is the official implementation of the paper "VisionUnite: A Vision-Language Foundation Model for Ophtha…☆41Updated last month
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆31Updated 11 months ago
- [TPAMI 2024] Measurement Guidance in Difffusion Models: Insight from Medical Image Synthesis☆53Updated last year
- Official repository for the paper "Prototype Representation Joint Learning from Medical Images and Reports, ICCV 2023".☆74Updated last year
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆95Updated 9 months ago
- Official Implement of the paper "Unifying Segment Anything in Microscopy with Multimodal Large Language Model"☆15Updated last week
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆38Updated 4 months ago
- An offcial implementation for UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training☆31Updated 7 months ago
- [ICCV' 23] MRM: Masked Relation Modeling for Medical Image Pre-Training with Genetics☆10Updated 11 months ago
- [MICCAI 2025 Best Paper Award Runner-up] Learning Segmentation from Radiology Reports☆53Updated this week
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆28Updated 3 months ago
- The collection of medical VLP papars☆19Updated last year
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancement☆26Updated 3 weeks ago
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆104Updated 6 months ago
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆15Updated 2 months ago
- ☆22Updated last month