yangbang18/MultiCapCLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yangbang18/MultiCapCLIP)

yangbang18 / MultiCapCLIP

(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning

☆36

Alternatives and similar repositories for MultiCapCLIP

Users that are interested in MultiCapCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liupeng0606 / clip4caption
View on GitHub
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆16Jan 2, 2023Updated 3 years ago
ylqi / GL-RG
View on GitHub
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
☆18May 10, 2023Updated 3 years ago
Sejong-VLI / V2T-Action-Graph-JKSUCIS-2023
View on GitHub
The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).
☆14Mar 29, 2023Updated 3 years ago
Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
View on GitHub
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 3 years ago
aimagelab / pacscore
View on GitHub
[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
☆66Jul 29, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
dhg-wei / DeCap
View on GitHub
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆144Mar 16, 2023Updated 3 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
yangbang18 / CARE
View on GitHub
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
☆32Dec 26, 2024Updated last year
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
sooperset / boss
View on GitHub
Bayesian Optimization Meets Self-Distillation, ICCV 2023
☆10Aug 28, 2023Updated 2 years ago
wzk1015 / CNMT
View on GitHub
[AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
☆24Mar 29, 2023Updated 3 years ago
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
AI-in-Health / Patient-Instructions
View on GitHub
[NeurIPS 2022] Code for "Retrieve, Reason, and Refine: Generating Accurate and Faithful Discharge/Patient Instructions"
☆36Jul 28, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
glicerico / SGNN
View on GitHub
Implementation of Self-Governing Neural Networks for speech act classification
☆12Updated this week
gsoykan / comics_text_plus
View on GitHub
Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"
☆26Jul 10, 2023Updated 3 years ago
microsoft / SwinBERT
View on GitHub
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆250May 26, 2022Updated 4 years ago
dahuang37 / show-and-tell-image-captioning
View on GitHub
Repo for reproducing show and tell: neural image captioning
☆11Dec 12, 2018Updated 7 years ago
VIStA-H / GPT-4V_Social_Media
View on GitHub
GPT-4V(ision) as A Social Media Analysis Engine
☆39Dec 20, 2024Updated last year
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆29Nov 11, 2022Updated 3 years ago
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
CyberAgentAILab / webcolor
View on GitHub
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Dec 7, 2023Updated 2 years ago
ml-jku / semantic-image-text-alignment
View on GitHub
☆25Jul 10, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / react
View on GitHub
REACT (CVPR 2023, Highlight 2.5%)
☆141Apr 7, 2023Updated 3 years ago
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
ezeli / InSentiCap_model
View on GitHub
A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).
☆11Jul 18, 2022Updated 4 years ago
joeyz0z / ConZIC
View on GitHub
Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
☆76Sep 20, 2023Updated 2 years ago
zhuang-li / FactualSceneGraph
View on GitHub
[ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained on FACTUAL.
☆131Jun 15, 2026Updated last month
yafuly / SyntacticGen
View on GitHub
☆16Jul 11, 2023Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shubhamprshr27 / NeglectedTailsVLM
View on GitHub
This repository houses the code for the paper - "The Neglected of VLMs"
☆30Dec 31, 2025Updated 6 months ago
navervision / lincir
View on GitHub
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
☆148Jan 5, 2026Updated 6 months ago
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated last month
cg1177 / Recursive-Multimodal-Agent
View on GitHub
☆19Jul 1, 2026Updated 2 weeks ago
FudanDISC / ReForm-Eval
View on GitHub
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
☆46Nov 17, 2023Updated 2 years ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago