Redaimao / awesome-multimodal-sequence-learningLinks

Reading list for multimodal sequence learning

☆13

Alternatives and similar repositories for awesome-multimodal-sequence-learning

Users that are interested in awesome-multimodal-sequence-learning are comparing it to the libraries listed below

Sorting:

XL2248 / SOV-MAS
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆10Updated 2 years ago
eyalbd2 / PADA
Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".
☆51Updated 3 years ago
zhegan27 / LXMERT-AdvTrain
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Updated 4 years ago
kugwzk / DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆30Updated 2 years ago
wrk226 / pytorch-multimodal_sarcasm_detection
It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"
☆15Updated 2 years ago
fomalhautb / KM-BART
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation
☆31Updated 3 years ago
yiren-jian / LM-SupCon
[NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners
☆22Updated 2 years ago
RUCAIBox / VDA
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models
☆16Updated 3 years ago
ictnlp / DSTC8-AVSD
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…
☆56Updated 2 years ago
xwgeng / SSAN
How Does Selective Mechanism Improve Self-attention Networks?
☆29Updated 4 years ago
Ydongd / prototypical-prompt-verbalizer
☆19Updated 3 years ago
declare-lab / MM-InstructEval
This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…
☆29Updated 4 months ago
RaleLee / DialogueGCN
A preprocessing and training code for DialogueGCN on Dailydialogue and Mastodon dataset. Use Bert base to preprocess the sentences. Based…
☆29Updated 3 years ago
xiaolin1207 / HTTN-master
The code for "Does Head Label Help for Long-Tailed Multi-Label Text Classific"
☆29Updated 4 years ago
NLP2CT / ua-cl-nmt
Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)
☆11Updated 5 years ago
codezakh / exploiting-BERT-thru-translation
[ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"
☆40Updated 4 years ago
phellonchen / DMRM
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
☆24Updated 3 years ago
prdwb / okvqa-release
☆14Updated 4 years ago
RunxinXu / ChildTuning
Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》
☆61Updated 3 years ago
sh0416 / clrcmd
Official Repository for CLRCMD (Appear in ACL2022)
☆42Updated 2 years ago
yiren-jian / NonLing-CSE
[NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
☆22Updated 2 years ago
YuJungHeo / kbvqa-public
☆39Updated 2 years ago
wutong8023 / Awesome_Few_Shot_Learning
Advances of few-shot learning, especially for NLP applications.
☆30Updated 2 years ago
NLP2CT / Meta-Curriculum
Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (AAAI 2021)
☆25Updated 3 years ago
ShannonAI / OpenViDial
Code, Models and Datasets for OpenViDial Dataset
☆131Updated 3 years ago
jokieleung / CL-VQA
the implementation of EMNLP 2020 "Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering"
☆15Updated 3 years ago
woojeongjin / FewVLM
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42Updated 3 years ago
limanling / m2e2
Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)
☆75Updated last year
qcwthu / Lifelong-Fewshot-Language-Learning
The code for lifelong few-shot language learning
☆55Updated 3 years ago
XL2248 / MSCTD
Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"
☆41Updated 7 months ago