jayleicn/VideoLanguageFuturePred

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jayleicn/VideoLanguageFuturePred)

jayleicn / VideoLanguageFuturePred

[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction

☆52

Alternatives and similar repositories for VideoLanguageFuturePred

Users that are interested in VideoLanguageFuturePred are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
jamespark3922 / visual-comet
View on GitHub
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
☆87Jun 12, 2023Updated 3 years ago
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
VALUE-Leaderboard / StarterCode
View on GitHub
Starter Code for VALUE benchmark
☆79Aug 23, 2022Updated 3 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
airsplay / VisualRelationships
View on GitHub
Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆63Sep 30, 2020Updated 5 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
zhangybzbo / EnvBiasVLN
View on GitHub
Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"
☆16May 6, 2020Updated 6 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated last year
XgDuan / WSDEC
View on GitHub
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Mar 21, 2020Updated 6 years ago
AnnikaLindh / Diverse_and_Specific_Image_Captioning
View on GitHub
Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…
☆13May 25, 2025Updated last year
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fanchenyou / HME-VideoQA
View on GitHub
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆55Sep 13, 2021Updated 4 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
ruotianluo / GoogleConceptualCaptioning
View on GitHub
☆54Dec 13, 2019Updated 6 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
microsoft / LAVENDER
View on GitHub
A Unified Framework for Video-Language Understanding
☆62Jun 17, 2023Updated 3 years ago
allenai / visual-reasoning-rationalization
View on GitHub
Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper
☆24Jan 15, 2021Updated 5 years ago
jssprz / attentive_specialized_network_video_captioning
View on GitHub
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
☆15Apr 6, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago
YuxiXie / V-DPO
View on GitHub
Preference Learning for LLaVA
☆60Nov 9, 2024Updated last year
eric-xw / Video-guided-Machine-Translation
View on GitHub
Starter code for the VMT task and challenge
☆51Jul 29, 2020Updated 5 years ago
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
zzxslp / XL-VLN
View on GitHub
Dataset for Bilingual VLN
☆11Dec 5, 2020Updated 5 years ago
lixiangpengcs / PSAC
View on GitHub
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
☆27Apr 15, 2021Updated 5 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fenglinliu98 / MIA
View on GitHub
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Oct 19, 2020Updated 5 years ago
cooelf / UVR-NMT
View on GitHub
Neural Machine Translation with universal Visual Representation (ICLR 2020)
☆92Jul 1, 2020Updated 6 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
ZiyueWu59 / CCA
View on GitHub
☆15Jan 16, 2024Updated 2 years ago
LisaAnne / LocalizingMoments
View on GitHub
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
☆198Oct 31, 2020Updated 5 years ago
szq0214 / MSR-VTT-Challenge
View on GitHub
Video to Language Challenge (MSR-VTT Challenge 2016)
☆32Dec 28, 2017Updated 8 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago