Deanplayerljx/tab-vcr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Deanplayerljx/tab-vcr)

Deanplayerljx / tab-vcr

Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671

☆19

Alternatives and similar repositories for tab-vcr

Users that are interested in tab-vcr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
zhegan27 / VILLA
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…
☆119Jan 13, 2021Updated 5 years ago
rowanz / r2c
View on GitHub
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
☆469May 6, 2021Updated 5 years ago
jssprz / attentive_specialized_network_video_captioning
View on GitHub
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
☆15Apr 6, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
jiasenlu / vilbert_beta
View on GitHub
☆478Nov 21, 2022Updated 3 years ago
AlenUbuntu / Awesome-Vision-and-Language-PreTrain-Papers
View on GitHub
☆14Dec 25, 2020Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
forwchen / HVTG
View on GitHub
Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"
☆17Aug 25, 2020Updated 5 years ago
zzxslp / XL-VLN
View on GitHub
Dataset for Bilingual VLN
☆11Dec 5, 2020Updated 5 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago
PKU-ICST-MIPL / CKRM_TCSVT2020
View on GitHub
Source code of our TCSVT 2020 paper "Multi-level Knowledge Injecting for Visual Commonsense Reasoning"
☆11Sep 18, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
zjiayao / cvpr17
View on GitHub
CVPR '17 Paper Collection
☆10Jul 17, 2017Updated 9 years ago
VegB / iNLG
View on GitHub
Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".
☆17Feb 3, 2023Updated 3 years ago
jacobswan1 / ViTCAP
View on GitHub
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
☆43May 28, 2022Updated 4 years ago
chhwang / cmcl
View on GitHub
This code is for the paper "Confident Multiple Choice Learning".
☆17Aug 4, 2018Updated 7 years ago
lupantech / dual-mfa-vqa
View on GitHub
Co-attending Regions and Detections for VQA.
☆40Jun 2, 2018Updated 8 years ago
airsplay / VisualRelationships
View on GitHub
Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆63Sep 30, 2020Updated 5 years ago
eric-xw / Video-guided-Machine-Translation
View on GitHub
Starter code for the VMT task and challenge
☆51Jul 29, 2020Updated 5 years ago
madhawav / MML
View on GitHub
Multi-faceted Video Moment Localizer
☆17Jun 19, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
NVlabs / PerVLBenchmark
View on GitHub
☆11Jul 31, 2022Updated 3 years ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
BigRedT / vico
View on GitHub
Multi-sense word embeddings from visual co-occurrences
☆25Sep 5, 2019Updated 6 years ago
Deferf / CLIP_Video_Representation
View on GitHub
Use CLIP to represent video for Retrieval Task
☆71Mar 1, 2021Updated 5 years ago
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
lixiangpengcs / PSAC
View on GitHub
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
☆27Apr 15, 2021Updated 5 years ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
bearcatt / LaBERT
View on GitHub
A length-controllable and non-autoregressive image captioning model.
☆69Jun 10, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
runzhouge / MAC
View on GitHub
MAC: Mining Activity Concepts for Language-based Temporal Localization
☆36Nov 26, 2018Updated 7 years ago
asudahkzj / Wnet
View on GitHub
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
☆24Sep 6, 2022Updated 3 years ago
jwehrmann / lmtd
View on GitHub
Labeled Movie Trailer Dataset
☆16Mar 23, 2018Updated 8 years ago
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
darthgera123 / PanoHDR-NeRF
View on GitHub
Code for Casual Indoor HDR Radiance Capture from Omnidirectional Images. BMVC 22
☆13Dec 16, 2022Updated 3 years ago
jinmang2 / RetroReader
View on GitHub
Implement Retrospective Reader for Machine Reading Comprehension with 🤗 transformers and datasets
☆19Jun 7, 2022Updated 4 years ago
Guaranteer / VidSTG-Dataset
View on GitHub
This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentenc…
☆70May 1, 2020Updated 6 years ago