njucckevin/KnowCap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/njucckevin/KnowCap)

njucckevin / KnowCap

Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model

☆13

Alternatives and similar repositories for KnowCap

Users that are interested in KnowCap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Liac-li / MM-self-improve-qwen2vl
View on GitHub
☆13Dec 9, 2024Updated last year
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
feizc / DeeCap
View on GitHub
Dynamic Early Exit for Image Captioning
☆17Oct 25, 2022Updated 3 years ago
RitaRamo / extra
View on GitHub
Retrieval-augmented Image Captioning
☆13Feb 16, 2023Updated 3 years ago
njucckevin / OpenMobile-Code
View on GitHub
The model, data and code for OpenMobile
☆49Jul 9, 2026Updated last week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
delchiaro / RATT
View on GitHub
☆18Oct 3, 2023Updated 2 years ago
xuanlinli17 / large_vlm_distillation_ood
View on GitHub
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)
☆60Apr 8, 2024Updated 2 years ago
starreeze / efuf
View on GitHub
the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…
☆21Apr 9, 2025Updated last year
Agchai52 / GAN_Based_Image_Deblurring_Using_Dark_Channel_Prior
View on GitHub
This is the official code for the paper "GAN Based Image Deblurring Using Dark Channel Prior"
☆13Apr 5, 2019Updated 7 years ago
CUMTGG / CIIC
View on GitHub
☆18Sep 13, 2023Updated 2 years ago
boreng0817 / IFCap
View on GitHub
[EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
☆15May 13, 2025Updated last year
PKU-ICST-MIPL / LFR-GAN_TOMM2023
View on GitHub
Official repository for "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation" (TOMM 2023)…
☆10Mar 21, 2023Updated 3 years ago
gyq716 / Graph2Seq
View on GitHub
Graph Neural Networks including GAT\GCN\GGNN，GGNN-LSTM,using scene graph to generate captions
☆21Jul 19, 2019Updated 7 years ago
eminorhan / humanlike-vits
View on GitHub
ViT models pretrained with up to ~5k hours of human-like video data
☆14Aug 10, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
csjcai / DBCPeNet
View on GitHub
Dark and Bright Channel Priors Embedded Network for Dynamic Scene Deblurring
☆24Jul 15, 2020Updated 6 years ago
bearcatt / LaBERT
View on GitHub
A length-controllable and non-autoregressive image captioning model.
☆69Jun 10, 2021Updated 5 years ago
Lihr747 / CgtGAN
View on GitHub
☆20May 3, 2025Updated last year
aimagelab / PMA-Net
View on GitHub
[ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
☆19Jun 7, 2024Updated 2 years ago
swaggy-TN / EfficientVLM
View on GitHub
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)
☆33Jul 18, 2023Updated 3 years ago
HAWLYQ / InfoMetIC
View on GitHub
☆13Sep 5, 2023Updated 2 years ago
xszheng2020 / memorization
View on GitHub
An Empirical Study of Memorization in NLP (ACL 2022)
☆13Jun 22, 2022Updated 4 years ago
vyraun / long-tailed
View on GitHub
Code for "On Long-Tailed Phenomena in NMT".
☆10Jan 10, 2021Updated 5 years ago
flauted / coco-caption
View on GitHub
☆10Dec 28, 2018Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
maroo-sky / FSD
View on GitHub
Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code
☆11Jul 17, 2023Updated 3 years ago
junyangwang0410 / HaELM
View on GitHub
An automatic MLLM hallucination detection framework
☆19Sep 26, 2023Updated 2 years ago
aaronma2020 / MSGO
View on GitHub
☆16Apr 7, 2026Updated 3 months ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
ahmedssabir / Belief-Revision-Score
View on GitHub
Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022
☆11Apr 13, 2025Updated last year
xu1998hz / SEScore
View on GitHub
This repo contains all the codes for SEScore implementation
☆15Mar 3, 2025Updated last year
ajd12342 / why-winoground-hard
View on GitHub
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31May 29, 2023Updated 3 years ago
libeineu / SDT-Training
View on GitHub
The implementation of "Shallow-to-Deep Training for Neural Machine Translation"
☆10Oct 26, 2020Updated 5 years ago
ohlionel / Prune-Tune
View on GitHub
Official code repository for AAAI2021 paper Finding Sparse Structures for Domain Specific Neural Machine Translation
☆11Apr 1, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ruotianluo / lmdbdict
View on GitHub
A simple wrapper for lmdb. Support dict-like operations.
☆23Apr 20, 2023Updated 3 years ago
shubhamprshr27 / NeglectedTailsVLM
View on GitHub
This repository houses the code for the paper - "The Neglected of VLMs"
☆30Dec 31, 2025Updated 6 months ago
CONE-MT / Lego-MT
View on GitHub
☆10Mar 22, 2024Updated 2 years ago
XMUDeepLIT / Robust-knn-mt
View on GitHub
Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)
☆12Oct 18, 2022Updated 3 years ago
sudanl / kNN-TL
View on GitHub
kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)
☆11Jul 26, 2023Updated 2 years ago
zhengxxn / UDA-KNN
View on GitHub
☆12Aug 31, 2021Updated 4 years ago
X-LANCE / weblm
View on GitHub
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
☆18Mar 6, 2024Updated 2 years ago