antoine77340 / video_feature_extractor
Easy to use video deep features extractor
☆309Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for video_feature_extractor
- Multi-Modal Transformer for Video Retrieval☆258Updated last month
- Code for the HowTo100M paper☆252Updated 4 years ago
- Video embeddings for retrieval with natural language queries☆336Updated last year
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆535Updated 3 weeks ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆191Updated 4 years ago
- Video Summarization Dataset, Papers, Codes☆156Updated 6 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆226Updated last year
- A repository for extract CNN features from videos using pytorch☆69Updated last year
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆214Updated 2 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆338Updated 3 months ago
- ☆188Updated 3 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆143Updated last year
- A Dataset for Grounded Video Description☆159Updated 2 years ago
- Video Grounding and Captioning☆323Updated 3 years ago
- Code for I3D Feature Extraction☆138Updated 5 years ago
- Mixture-of-Embeddings-Experts☆118Updated 4 years ago
- A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is ac…☆291Updated 2 years ago
- I3D Nonlocal ResNets in Pytorch☆246Updated 2 years ago
- The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection☆216Updated 3 years ago
- Release of the pretrained S3D Network in PyTorch (ECCV 2018)☆127Updated last year
- Pytorch C3D feature extractor☆130Updated 6 years ago
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆230Updated 3 years ago
- PyTorch implementation of the ACCV 2018-AIU2018 paper Video Summarization with Attention☆180Updated 2 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆209Updated 4 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆148Updated 5 years ago
- TALL: Temporal Activity Localization via Language Query☆188Updated 6 years ago
- Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"☆190Updated 4 years ago
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆220Updated 6 months ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆288Updated 2 years ago
- Inflated i3d network with inception backbone, weights transfered from tensorflow☆528Updated 5 months ago