A consistent Med-VQA dataset, C-SLAKE , extended by Slake for further consistency assessment .
☆17Jan 12, 2024Updated 2 years ago
Alternatives and similar repositories for CSLAKE
Users that are interested in CSLAKE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA☆12Dec 13, 2023Updated 2 years ago
- Consistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering☆16Jan 12, 2024Updated 2 years ago
- Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation☆15Jan 25, 2025Updated last year
- Observation Driven Memory Synergistic Planning for Continuous Vision-Language Navigation☆33Jun 14, 2024Updated last year
- [The Visual Computer] The official implementation of "Feature Distribution Normalization Network for Multi-View Stereo”.☆14Mar 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Oct 15, 2025Updated 6 months ago
- [IEEE JSTARS] The official implementation of "Surface Depth Estimation from Multi-view Stereo Satellite Images with Distribution Contrast…☆10May 16, 2025Updated 11 months ago
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆10Aug 16, 2024Updated last year
- The repo of the paper: Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medic…☆11May 26, 2023Updated 2 years ago
- ☆17Jul 21, 2022Updated 3 years ago
- Code implementation of RP3D-Diag☆17Nov 25, 2024Updated last year
- Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"☆16Feb 22, 2023Updated 3 years ago
- [IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering☆16Oct 9, 2022Updated 3 years ago
- ☆19Oct 13, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆23Mar 25, 2026Updated last month
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 4 years ago
- [Science Advances] Demographic Bias of Vision-Language Foundation Models in Medical Imaging☆21Mar 28, 2025Updated last year
- ☆39Mar 19, 2026Updated last month
- PyTorch implementation of video captioning☆13Sep 24, 2017Updated 8 years ago
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆19Feb 25, 2023Updated 3 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆26Mar 28, 2023Updated 3 years ago
- AIOZ AI - Overcoming Data Limitation in Medical Visual Question Answering (MICCAI 2019)☆69Apr 21, 2026Updated last week
- A Layered Memory Network for MovieQA☆16Apr 27, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The first ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue☆37Oct 1, 2024Updated last year
- code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering☆29May 30, 2025Updated 11 months ago
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆20May 10, 2022Updated 3 years ago
- PyTorch code for ROLL, a knowledge-based video story question answering model.☆21Sep 29, 2020Updated 5 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- Traces the boundary of a set of points belonging to an aerial LiDAR scan of a building (part).☆34Jan 9, 2023Updated 3 years ago
- ☆20Nov 25, 2024Updated last year
- Deep Multimodal Neural Architecture Search☆29Nov 15, 2020Updated 5 years ago
- ☆44Oct 20, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆35Apr 23, 2026Updated last week
- ☆68Mar 10, 2026Updated last month
- [ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention☆35Dec 15, 2022Updated 3 years ago
- ☆18Aug 29, 2025Updated 8 months ago
- Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)☆37Apr 21, 2026Updated last week
- ☆25Sep 8, 2017Updated 8 years ago
- A framework for Longitudinal Radiology Report Generation☆28Aug 10, 2024Updated last year