guilk/KAT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guilk/KAT)

guilk / KAT

Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"

☆71

Alternatives and similar repositories for KAT

Users that are interested in KAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hackerchenzhuo / LaKo
View on GitHub
[Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
☆24Feb 9, 2024Updated 2 years ago
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
alirezasalemi7 / DEDR-MM-FiD
View on GitHub
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆14Aug 22, 2023Updated 2 years ago
AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
yuanze-lin / REVIVE
View on GitHub
[NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
☆105Apr 6, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LinWeizheDragon / Retrieval-Augmented-Visual-Question-Answering
View on GitHub
This is the official repository for Retrieval Augmented Visual Question Answering
☆251Dec 19, 2024Updated last year
cdancette / vqa-cp-leaderboard
View on GitHub
A collections of papers about VQA-CP datasets and their results
☆42Mar 18, 2022Updated 4 years ago
alexandrosXe / A-Simple-Baseline-For-Knowledge-Based-VQA
View on GitHub
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
☆25Dec 14, 2023Updated 2 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
microsoft / PICa
View on GitHub
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆88Apr 10, 2022Updated 4 years ago
WebQnA / WebQA_Baseline
View on GitHub
☆32Apr 24, 2024Updated 2 years ago
guilk / VLC
View on GitHub
Research code for "Training Vision-Language Transformers from Captions Alone"
☆33Jul 15, 2022Updated 4 years ago
quangvnai / visdial
View on GitHub
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
☆29Aug 5, 2021Updated 4 years ago
open-vision-language / infoseek
View on GitHub
☆78Oct 27, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
e-spaulding / xpo
View on GitHub
☆12Jun 18, 2024Updated 2 years ago
phiyodr / vqaloader
View on GitHub
PyTorch DataLoader for many VQA datasets
☆15Jan 10, 2023Updated 3 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
edchengg / infoseek_eval
View on GitHub
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆26May 30, 2024Updated 2 years ago
FreedomIntelligence / TRIM
View on GitHub
We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…
☆22Jan 11, 2026Updated 6 months ago
Yushi-Hu / PromptCap
View on GitHub
natual language guided image captioning
☆89Feb 11, 2024Updated 2 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
tejas-gokhale / vqa_mutant
View on GitHub
☆13Feb 14, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
dali-does / clevr-math
View on GitHub
☆13May 9, 2023Updated 3 years ago
facebookresearch / UniK-QA
View on GitHub
Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering
☆51Aug 2, 2022Updated 3 years ago
webis-de / scidata22-stereo-scientific-text-reuse
View on GitHub
☆11Dec 2, 2024Updated last year
sEhsanTaher / Beheshti-NER
View on GitHub
Beheshti-NER: Persian named entity recognition Using BERT
☆14May 16, 2021Updated 5 years ago
limanling / KnowledgeVL-Reading
View on GitHub
☆67Jun 18, 2023Updated 3 years ago
CristianCristanchoT / Hedwig-IA
View on GitHub
This is a tool for automatic image labeling using natural language
☆14Nov 27, 2023Updated 2 years ago
miguelsvasco / gmc
View on GitHub
Official Implementation of "Geometric Multimodal Contrastive Representation Learning" (https://arxiv.org/abs/2202.03390)
☆28Jan 6, 2025Updated last year
Gordon-Guojun-Zhang / Transferability-NeurIPS2021
View on GitHub
This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)
☆13Nov 1, 2022Updated 3 years ago
tetherless-world / explanation-ontology
View on GitHub
Explanation Ontology Resource website
☆12Jun 8, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tuhinjubcse / MetaphorGenNAACL2021
View on GitHub
Code for MERMAID : Metaphor Generation with Symbolism and Discriminative Decoding
☆11May 2, 2022Updated 4 years ago
BierOne / Attention-Faithfulness
View on GitHub
[ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…
☆20Jul 21, 2022Updated 4 years ago
gangiswag / infogent
View on GitHub
☆24Mar 1, 2025Updated last year
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago
raeidsaqur / mgn
View on GitHub
Multimodal Graph Network (MGN): Code repo, examples from the paper
☆25Apr 30, 2021Updated 5 years ago
baaaad / ECE
View on GitHub
[ECCV'22 Poster] Explicit Image Caption Editing
☆22Nov 30, 2022Updated 3 years ago
laihuiyuan / mFLAG
View on GitHub
Multi-Figurative Language Generation (COLING 2022)
☆12Jan 30, 2023Updated 3 years ago