Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for emscore
Users that are interested in emscore are comparing it to the libraries listed below
Sorting:
- ☆11Sep 7, 2020Updated 5 years ago
- Scene Graph Parsing as Dependency Parsing☆41May 22, 2019Updated 6 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆65Jul 29, 2025Updated 7 months ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆82Jul 17, 2020Updated 5 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).☆36Feb 26, 2022Updated 4 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Oct 21, 2021Updated 4 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"☆17Nov 21, 2022Updated 3 years ago
- ☆19Nov 22, 2022Updated 3 years ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- A unified framework to jointly model images, text, and human attention traces.☆79May 24, 2021Updated 4 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- Human-like Controllable Image Captioning with Verb-specific Semantic Roles.☆36Mar 11, 2022Updated 3 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆56Jul 8, 2024Updated last year
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 2 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Feb 16, 2024Updated 2 years ago
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆42May 13, 2022Updated 3 years ago
- A length-controllable and non-autoregressive image captioning model.☆69Jun 10, 2021Updated 4 years ago
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆229Jan 3, 2024Updated 2 years ago
- ☆11Feb 5, 2021Updated 5 years ago
- PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"☆11Sep 26, 2021Updated 4 years ago
- fork from https://github.com/jwyang/faster-rcnn.pytorch☆10Aug 6, 2018Updated 7 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)☆113Jun 6, 2022Updated 3 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"☆14Oct 13, 2020Updated 5 years ago
- SODA: Story Oriented Dense Video Captioning Evaluation Framework☆13May 3, 2024Updated last year
- ☆10Oct 28, 2021Updated 4 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆51Dec 18, 2019Updated 6 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆59Mar 24, 2023Updated 2 years ago