salanueva / UniVSELinks

UniVSE implementation on Python3

☆10

Alternatives and similar repositories for UniVSE

Users that are interested in UniVSE are comparing it to the libraries listed below

Sorting:

cshizhe / eval_cap
Improved evaluation codes for common visual captioning metrics.
☆11Updated 3 years ago
Sy-Zhang / TCMN-Release
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆14Updated 2 years ago
zinengtang / DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Updated 2 years ago
cdancette / detect-shortcuts
Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
☆27Updated last year
jayleicn / VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆49Updated 2 years ago
syuqings / video-paragraph
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆66Updated 3 years ago
MILVLG / rosita
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
☆56Updated 2 years ago
zmykevin / UVLP
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆22Updated 3 years ago
woojeongjin / FewVLM
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42Updated 3 years ago
mad-red / VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Updated 3 years ago
gujiuxiang / unpaired_image_captioning
Unpaired Image Captioning
☆36Updated 4 years ago
zmykevin / UC2
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Updated 3 years ago
SpencerWhitehead / novelvqa
☆27Updated 3 years ago
VALUE-Leaderboard / DataRelease
Data Release for VALUE Benchmark
☆31Updated 3 years ago
hardyqr / HAL
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Updated last year
StanfordVL / STGraph
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Updated 5 years ago
wenhuchen / Meta-Module-Network
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43Updated 4 years ago
yj-yu / lsmdc
☆32Updated 6 years ago
Xun-Yang / Causal_Video_Moment_Retrieval
The codes and features of the re-implementation of SIGIR 2021 work "Deconfounded Video Moment Retrieval with Causal Intervention"
☆34Updated 3 years ago
jialinwu17 / MAVEX
☆30Updated 2 years ago
JaywongWang / CBP
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆60Updated 2 years ago
thunlp / VisualDS
☆26Updated 3 years ago
jayleicn / mTVRetrieval
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Updated 2 years ago
ShiYaya / emscore
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Updated 2 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34Updated 5 years ago
showlab / Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆42Updated 3 years ago
zaynmi / seada-vqa
A pytorch implemetation of data augmentation method for visual question answering
☆21Updated 2 years ago
sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-Storytelling
The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
☆11Updated 3 years ago
Huntersxsx / RaNet
source code of our RaNet in EMNLP 2021
☆30Updated 3 years ago
yanxinzju / CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
☆78Updated 2 years ago