Event Sequence Generation Network
☆14Jun 22, 2021Updated 4 years ago
Alternatives and similar repositories for ESGN
Users that are interested in ESGN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Nov 4, 2020Updated 5 years ago
- Feature Extraction Toolbox from CUHKÐZ&SIAT submission to ActivityNet 2016☆32Mar 31, 2019Updated 7 years ago
- ☆192Jun 16, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆27May 31, 2021Updated 4 years ago
- SODA: Story Oriented Dense Video Captioning Evaluation Framework☆14May 3, 2024Updated last year
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆230Apr 8, 2023Updated 3 years ago
- A curated list of research papers in Video Captioning☆121Jan 5, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Graph Convolutional Networks for Temporal Action Localization (ICCV2019)☆323Jul 4, 2020Updated 5 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Dense video captioning in PyTorch☆41Aug 30, 2019Updated 6 years ago
- TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)☆117Sep 16, 2023Updated 2 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Mar 23, 2026Updated 3 weeks ago
- ☆10Oct 7, 2023Updated 2 years ago
- [ACMMM 2022] ReCoRo: Region-Controllable Robust Light Enhancement by User-Specified Imprecise Masks☆15Feb 6, 2023Updated 3 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Tensorflow Implementation of the Paper "SST: Single-Stream Temporal Action Proposals" in CVPR 2017.☆48Aug 20, 2018Updated 7 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- A Pytorch Implementation of paper: "Neural Color Operators for Sequential Image Retouching", ECCV 2022☆10Oct 25, 2022Updated 3 years ago
- PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation☆12Dec 28, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Language Models Can See: Plugging Visual Controls in Text Generation☆258Jun 1, 2022Updated 3 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆23Jul 16, 2025Updated 8 months ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization (WACV 2021)☆10Jul 28, 2021Updated 4 years ago
- code for composite in situ imaging (cisi) analysis☆12Oct 26, 2020Updated 5 years ago
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Jul 19, 2017Updated 8 years ago