Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery”, MICCAI 2023
☆17Jul 7, 2024Updated last year
Alternatives and similar repositories for CAT-ViL
Users that are interested in CAT-ViL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆27Jul 7, 2024Updated last year
- Official implementation of “LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusio…☆21Jul 7, 2024Updated last year
- Official implementation of "EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy", MICCAI 2…☆12Jan 29, 2026Updated 4 months ago
- rendezvous-in-time☆13Sep 17, 2025Updated 8 months ago
- TMI 2023: Less is More: Surgical Phase Recognition from Timestamp Supervision☆22Feb 9, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Simple video summarisation Python package.☆25Jan 29, 2024Updated 2 years ago
- ☆15Nov 19, 2020Updated 5 years ago
- ☆198Feb 27, 2026Updated 3 months ago
- Domain adaptation framework for segmentation via reinforcement learning.☆15Oct 13, 2025Updated 8 months ago
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- S2ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation (MICCAI 2023)☆21Dec 1, 2023Updated 2 years ago
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆37Sep 17, 2025Updated 8 months ago
- ☆10Jun 6, 2024Updated 2 years ago
- A python3 library for evaluating caption's BLEU, Meteor, CIDEr, SPICE,ROUGE_L,WMD score. Fork from https://github.com/ruotianluo/coco-cap…☆22Nov 25, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of FuseMoE for FlexiModal Fusion, NeurIPS'24☆35Mar 26, 2026Updated 2 months ago
- How Much Position Information Do Convolutional Neural Networks Encode?