CUMTGG/CIIC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CUMTGG/CIIC)

CUMTGG / CIIC

☆18

Alternatives and similar repositories for CIIC

Users that are interested in CIIC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LX-doctorAI1 / DeltaNet
View on GitHub
☆18Nov 11, 2022Updated 3 years ago
feizc / DeeCap
View on GitHub
Dynamic Early Exit for Image Captioning
☆17Oct 25, 2022Updated 3 years ago
delchiaro / RATT
View on GitHub
☆18Oct 3, 2023Updated 2 years ago
guanghuixu / AnchorCaptioner
View on GitHub
☆30May 7, 2021Updated 5 years ago
njucckevin / KnowCap
View on GitHub
Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
☆13Feb 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gentlefress / MLIP
View on GitHub
The code of paper "MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning" accep…
☆10Mar 5, 2024Updated 2 years ago
Aman-4-Real / See-or-Guess
View on GitHub
[ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning
☆16Feb 17, 2025Updated last year
liubo105 / SAT
View on GitHub
Improving Medical Vision-Language Contrastive Pretraining with Semantics-aware Triage
☆11Jun 25, 2023Updated 3 years ago
cuhksz-nlp / R2GenCMN
View on GitHub
☆45Jul 31, 2021Updated 4 years ago
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
yangyan22 / Medical-Report-Generation-TriNet
View on GitHub
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation
☆18Nov 13, 2025Updated 8 months ago
uzh-dqbm-cmi / ARGON
View on GitHub
Progressive Transformer-Based Generation of Radiology Reports
☆25Jan 5, 2025Updated last year
CrystalSixone / VLN-GOAT
View on GitHub
Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)
☆103Jun 4, 2025Updated last year
tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
cheliu-computation / M-FLAG-MICCAI2023
View on GitHub
☆22Aug 1, 2023Updated 2 years ago
NovaMind-Z / PTSN
View on GitHub
Repository for an end-to-end image captioning method PTSN(ACM MM22).
☆60Dec 11, 2022Updated 3 years ago
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
hedongxiao-tju / NSLM
View on GitHub
Code & data accompanying the paper ["Unveiling Implicit Deceptive Patterns in Multi-modal Fake News via Neuro-Symbolic Reasoning"].
☆13Dec 21, 2023Updated 2 years ago
yangxuntu / lxmertcatt
View on GitHub
☆79Oct 8, 2022Updated 3 years ago
SVT-Yang / MedST
View on GitHub
Offical code of Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training[ICML 2024]
☆26May 31, 2024Updated 2 years ago
WissingChen / CMCRL
View on GitHub
The official implementation of “Cross-Modal Causal Representation Learning for Radiology Report Generation” （IEEE T-IP 2025）
☆68May 27, 2025Updated last year
zjukongming / TranSQ
View on GitHub
MICCAI 22 accepted paper “TranSQ: Transformer-based Semantic Query for Medical Report Generation“ for medical report generation
☆27Sep 3, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LX-doctorAI1 / GSKET
View on GitHub
☆35Nov 22, 2022Updated 3 years ago
ivonajdenkoska / variational-xray-report-gen
View on GitHub
[MICCAI 2021 (Oral)] Official code repository for "Variational Topic Inference for Chest X-Ray Report Generation"
☆21Mar 7, 2022Updated 4 years ago
SinHanYang / Dual-CAN
View on GitHub
Entity-Aware Dual Co-Attention Network for Fake News Detection, EACL 2023 Findings
☆10Jun 11, 2023Updated 3 years ago
calisolo / Levels_image_captioning_NICE
View on GitHub
NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU
☆11Jun 22, 2023Updated 3 years ago
ltpwy / MSCI
View on GitHub
☆23May 18, 2025Updated last year
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
YuigaWada / Polos
View on GitHub
[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
☆33Jun 12, 2026Updated last month
FuxiaoLiu / Twitter-Video-dataset
View on GitHub
[EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms
☆12Sep 26, 2023Updated 2 years ago
batra-mlp-lab / vln-sim2real-envs
View on GitHub
Code and utilities for creating a Vision-and-Language Navigation (VLN) simulator environment from a physical space.
☆12Nov 10, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Markin-Wang / CLEViT
View on GitHub
[IJCAI 2023] CLE-ViT: Contrastive Learning Encoded Transformer for Ultra-Fine-Grained Visual Categorization.
☆10Nov 3, 2023Updated 2 years ago
zhjohnchan / PTUnifier
View on GitHub
[ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
☆78Mar 22, 2024Updated 2 years ago
HJYao00 / R1-ShareVL
View on GitHub
[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward
☆38Sep 19, 2025Updated 10 months ago
YtongXie / PairAug
View on GitHub
[CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
☆29Nov 11, 2024Updated last year
Akomand / CausalDiffAE
View on GitHub
Code Repository for CausalDiffAE (ECAI 2024)
☆26Oct 19, 2024Updated last year
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
3dlg-hcvc / LAW-VLNCE
View on GitHub
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
☆13Nov 29, 2021Updated 4 years ago