cshizhe / eval_capLinks

Improved evaluation codes for common visual captioning metrics.

☆11

Alternatives and similar repositories for eval_cap

Users that are interested in eval_cap are comparing it to the libraries listed below

Sorting:

salanueva / UniVSE
UniVSE implementation on Python3
☆10Updated 4 years ago
zinengtang / DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Updated 2 years ago
syuqings / video-paragraph
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆66Updated 3 years ago
jayleicn / VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆49Updated 2 years ago
Sy-Zhang / TCMN-Release
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆14Updated 2 years ago
gujiuxiang / unpaired_image_captioning
Unpaired Image Captioning
☆36Updated 4 years ago
zaynmi / seada-vqa
A pytorch implemetation of data augmentation method for visual question answering
☆21Updated 2 years ago
ShiYaya / emscore
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Updated 2 years ago
JaywongWang / CBP
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆60Updated 2 years ago
jayleicn / TVCaption
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆90Updated last year
wangpengnorman / KB-Ref_dataset
☆15Updated 4 years ago
yj-yu / lsmdc
☆32Updated 6 years ago
VALUE-Leaderboard / DataRelease
Data Release for VALUE Benchmark
☆31Updated 3 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34Updated 5 years ago
LisaAnne / TemporalLanguageRelease
☆43Updated 4 years ago
LuoweiZhou / YouCook2-Leaderboard
A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.
☆40Updated 3 years ago
Kien085 / SG2Caps
☆22Updated 3 years ago
zmykevin / UVLP
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆22Updated 3 years ago
ych133 / How2R-and-How2QA
A video retrieval dataset How2R and a video QA dataset How2QA
☆24Updated 4 years ago
zfchenUnique / VID-Sentence
This repository provides the dataset introduced by our WSSTG paper
☆12Updated 5 years ago
mad-red / VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Updated 3 years ago
jayleicn / mTVRetrieval
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Updated 2 years ago
daqingliu / awesome-rec
A curated list of research papers in Referring Expression Comprehension (REC)
☆43Updated 4 years ago
jayleicn / TVQAplus
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆129Updated 2 years ago
showlab / Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆42Updated 3 years ago
TheShadow29 / vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆67Updated 5 years ago
doc-doc / vRGV
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
☆57Updated 2 years ago
erobic / negative_analysis_of_grounding
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Updated 5 years ago
SpencerWhitehead / novelvqa
☆27Updated 3 years ago
VALUE-Leaderboard / StarterCode
Starter Code for VALUE benchmark
☆80Updated 2 years ago