☆30Sep 16, 2024Updated last year
Alternatives and similar repositories for Cataract-1K
Users that are interested in Cataract-1K are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆47May 23, 2025Updated 10 months ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- ☆55Jun 12, 2025Updated 9 months ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆61Jul 5, 2025Updated 8 months ago
- List of surgical tool datasets organised by task.☆172Aug 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jun 26, 2022Updated 3 years ago
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆36Sep 17, 2025Updated 6 months ago
- ☆18Sep 19, 2025Updated 6 months ago
- The official implementation for paper: Vision-Language Models are Strong Noisy Label Detectors☆16Mar 31, 2025Updated 11 months ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆64Mar 27, 2023Updated 3 years ago
- ☆15Nov 28, 2024Updated last year
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated 3 weeks ago
- Official repository of the GraSP dataset and implemention of TAPIS☆51Dec 31, 2024Updated last year
- [MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures☆80Sep 14, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆56Jun 16, 2025Updated 9 months ago
- [CVPR2024] Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation☆19Sep 3, 2024Updated last year
- TheGamesDB.net Repository - An open, online database for video game fans.☆18Mar 20, 2026Updated last week
- ☆58Jul 9, 2025Updated 8 months ago
- ☆16Oct 31, 2023Updated 2 years ago
- Large-scale Self-supervised Pre-training for Endoscopy☆48Jun 11, 2024Updated last year
- Repository of the Mobile Brazilian Retinal Dataset (mBRSET)☆16Jul 8, 2024Updated last year
- [TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation☆23Dec 20, 2022Updated 3 years ago
- ☆45Feb 16, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ViLReF: A Expert Knowledge Enabled Vision-Language Retinal Foundation Model☆23Oct 16, 2024Updated last year
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…☆31Apr 2, 2024Updated last year
- Code for continual machine learning using a dynamic memory (Perkonigg et al. Nat Comms 2021)☆16Jul 26, 2023Updated 2 years ago
- [MedIA'25] FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.☆173Nov 27, 2025Updated 4 months ago
- 【ACM MM 2025】Official Repo for Paper ‘’EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark an…☆64Apr 21, 2025Updated 11 months ago
- RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports☆67Jan 30, 2026Updated last month
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆56Mar 2, 2026Updated 3 weeks ago
- ☆10Oct 7, 2023Updated 2 years ago
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…☆14Aug 29, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…☆53Aug 27, 2025Updated 7 months ago
- 简单的pagerank基础上加上稀疏化矩阵化并行化等处理☆12Oct 8, 2019Updated 6 years ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆47Apr 19, 2024Updated last year
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 4 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery☆15Dec 18, 2025Updated 3 months ago