[NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
☆58Mar 19, 2026Updated last week
Alternatives and similar repositories for EndoBench
Users that are interested in EndoBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion☆24Feb 5, 2026Updated last month
- [ICASSP2025] ConcealGS: Conceal Implicit Information in 3D Gaussian Splatting☆20Jan 22, 2025Updated last year
- Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision Language Models☆20Oct 12, 2025Updated 5 months ago
- ☆45Feb 16, 2026Updated last month
- [MICCAI'24] GBT: Geometric-oriented Brain Transformer for Autism Diagnosis☆14Sep 19, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆54Jun 12, 2025Updated 9 months ago
- basically all the things I used for this article☆25Jan 8, 2025Updated last year
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆17Mar 4, 2025Updated last year
- ☆35Mar 11, 2025Updated last year
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆61Jul 5, 2025Updated 8 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆32Apr 6, 2025Updated 11 months ago
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆80Sep 14, 2025Updated 6 months ago
- ☆13Dec 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ControlPolypNet: Towards Controlled Colon Polyp Synthesis for Improved Polyp Segmentation☆18Aug 19, 2024Updated last year
- Undistorted Depth Support for ScanNet++☆17Dec 8, 2023Updated 2 years ago
- ☆16Jul 5, 2021Updated 4 years ago
- DeepCA: Deep Learning-based 3D Coronary Artery Tree Reconstruction from Two 2D Non-simultaneous X-ray Angiography Projections☆20Jun 21, 2025Updated 9 months ago
- [TMI25] EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene Reconstruction☆189Apr 7, 2025Updated 11 months ago
- MTTM: Metamorphic Testing for Textual Content Moderation Software☆32Feb 10, 2023Updated 3 years ago
- [CVPR 2022] CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆139Jun 7, 2024Updated last year
- This is a joint project between Helmholtz Imaging (located at DKFZ) and Lin Yang and Otmar Schmid (Helmholtz Munich).☆12Nov 6, 2024Updated last year
- ☆13Jan 9, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆46Oct 15, 2025Updated 5 months ago
- [MedIA'22] Generative myocardial motion tracking via latent space exploration with biomechanics-informed prior☆11Apr 3, 2024Updated last year
- Code for "EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels."☆20Nov 14, 2024Updated last year
- There are compilations of surgery-related tasks, datasets, and papers.☆157Nov 9, 2025Updated 4 months ago
- [CVPR'25] MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models☆63May 27, 2025Updated 9 months ago
- ☆44Jan 19, 2026Updated 2 months ago
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆128Aug 18, 2024Updated last year
- [MICCAI 2025 Oral] Code for "EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training."☆26May 15, 2025Updated 10 months ago
- CNN Based Image Retrieval. SoTu☆12Jan 11, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆75Jan 22, 2024Updated 2 years ago
- [IEEE JBHI'25] Improving Foundation Model for Endoscopy Video Analysis via Representation Learning on Long Sequences☆25Oct 11, 2025Updated 5 months ago
- [NeurIPS 25 Spotlight] VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection☆62Oct 16, 2025Updated 5 months ago
- [MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train☆219Oct 11, 2025Updated 5 months ago
- SurgLaVi: Official repository☆28Mar 4, 2026Updated 3 weeks ago
- [WACV 2026] An extremely simple method for validation-free efficient adaptation of CLIP-like VLMs that is robust to the learning rate.☆32Apr 17, 2025Updated 11 months ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago