Event Sequence Generation Network
☆14Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for ESGN
Users that are interested in ESGN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Nov 4, 2020Updated 5 years ago
- Feature Extraction Toolbox from CUHKÐZ&SIAT submission to ActivityNet 2016☆32Mar 31, 2019Updated 6 years ago
- ☆192Jun 16, 2025Updated 9 months ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆27May 31, 2021Updated 4 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- SODA: Story Oriented Dense Video Captioning Evaluation Framework☆14May 3, 2024Updated last year
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆230Apr 8, 2023Updated 2 years ago
- Code for AAAI2020 paper "Fast Learning of Temporal Action Proposal via Dense Boundary Generator"☆353Apr 7, 2023Updated 2 years ago
- A curated list of research papers in Video Captioning☆121Jan 5, 2021Updated 5 years ago
- Graph Convolutional Networks for Temporal Action Localization (ICCV2019)☆323Jul 4, 2020Updated 5 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Dense video captioning in PyTorch☆41Aug 30, 2019Updated 6 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- ☆10Oct 16, 2025Updated 5 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- [ACMMM 2022] ReCoRo: Region-Controllable Robust Light Enhancement by User-Specified Imprecise Masks☆15Feb 6, 2023Updated 3 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Optical flow extraction tool using OpenCV☆16Dec 29, 2016Updated 9 years ago
- Tensorflow Implementation of the Paper "SST: Single-Stream Temporal Action Proposals" in CVPR 2017.☆48Aug 20, 2018Updated 7 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆258Jun 1, 2022Updated 3 years ago
- Code for our paper in ACL 2017☆13Dec 14, 2017Updated 8 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- ☆15Nov 19, 2020Updated 5 years ago