LuoweiZhou / anet2016-cuhk-featureView external linksLinks
Feature Extraction Toolbox from CUHKÐZ&SIAT submission to ActivityNet 2016
☆32Mar 31, 2019Updated 6 years ago
Alternatives and similar repositories for anet2016-cuhk-feature
Users that are interested in anet2016-cuhk-feature are comparing it to the libraries listed below
Sorting:
- ☆191Jun 16, 2025Updated 8 months ago
- Detectron for image/video region feature extraction, inspired by Xinlei's repo☆22Nov 21, 2020Updated 5 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- Event Sequence Generation Network☆14Jun 22, 2021Updated 4 years ago
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- Preprocess the activityNet dataset for detection task☆13Mar 3, 2017Updated 8 years ago
- Action Recognition Toolbox for CUHKÐZ&SIAT submission to ActivityNet 2016☆251Nov 19, 2018Updated 7 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆48Jun 22, 2024Updated last year
- ☆13Nov 19, 2020Updated 5 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Jul 17, 2017Updated 8 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- Easy to use video deep features extractor☆323Jul 5, 2020Updated 5 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- A Fast PyTorch implementation for ICCV 19 paper "BMN: Boundary-Matching Network for Temporal Action Proposal Generation"☆10Jul 29, 2019Updated 6 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- ☆16Dec 20, 2018Updated 7 years ago
- Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…☆104Mar 21, 2020Updated 5 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆15Jul 2, 2019Updated 6 years ago
- Official python implementation of R3-Transformer☆15Nov 30, 2020Updated 5 years ago
- PyTorch demo code for "Spatial-Temporal Pyramid Based Convolutional Neural Network for Action Recognition"☆15Oct 17, 2018Updated 7 years ago
- SST: Single-Stream Temporal Action Proposals (Official Repo)☆100Dec 8, 2022Updated 3 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- Official code and data for EMNLP 2020 paper "Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attenti…☆21Nov 27, 2020Updated 5 years ago
- A curated list of research papers in Video Captioning☆121Jan 5, 2021Updated 5 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- ☆20Oct 18, 2021Updated 4 years ago
- Graph Convolutional Networks for Temporal Action Localization (ICCV2019)☆323Jul 4, 2020Updated 5 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Mar 15, 2023Updated 2 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆28Sep 23, 2024Updated last year
- video captioning☆24Mar 14, 2019Updated 6 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- Implementation of "Temporal Recurrent Networks for Online Action Detection"☆23May 6, 2019Updated 6 years ago
- Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition☆54Jul 8, 2019Updated 6 years ago