Video Feature Extractor for S3D-HowTo100M
☆29Apr 30, 2021Updated 4 years ago
Alternatives and similar repositories for VideoFeatureExtractor
Users that are interested in VideoFeatureExtractor are comparing it to the libraries listed below
Sorting:
- code for downloading videos from HowTo100M dataset☆17May 13, 2021Updated 4 years ago
- ☆15May 23, 2023Updated 2 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆365Jul 25, 2024Updated last year
- Easy to use video deep features extractor☆322Jul 5, 2020Updated 5 years ago
- Repository for Multimodal AutoML Benchmark☆66Dec 7, 2021Updated 4 years ago
- [ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning☆28Jun 28, 2023Updated 2 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 4 years ago
- ☆23Dec 16, 2022Updated 3 years ago
- Animals3D: Learning Articulated Shape with Keypoint Pseudo-labels from Web Images (CVPR 2023)☆14May 20, 2024Updated last year
- ESIM model with lanuage model☆27Nov 10, 2018Updated 7 years ago
- Code for the HowTo100M paper☆298Mar 10, 2020Updated 6 years ago
- The implementation of our CIKM 2021 paper titled as: "Cross-Market Product Recommendation"☆20Nov 30, 2021Updated 4 years ago
- ☆24Feb 15, 2022Updated 4 years ago
- Cross-model active contrastive coding☆22Mar 17, 2021Updated 5 years ago
- Python implementation of extraction of several visual features representations from videos☆23Jul 19, 2021Updated 4 years ago
- 华为digix 2021 赛题1☆29Nov 10, 2021Updated 4 years ago
- ☆21Feb 18, 2022Updated 4 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆43Apr 17, 2023Updated 2 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- The Code for ICME2019 Grand Challenge: Short Video Understanding (Single Model Ranks 6th)☆91Sep 1, 2019Updated 6 years ago
- ☆22Jun 6, 2020Updated 5 years ago
- 2019中国高校计算机大赛——大数据挑战赛 第三名解决方案☆122Feb 16, 2020Updated 6 years ago
- Text-Image Relationships (ACL 2019)☆22Sep 15, 2023Updated 2 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆37Sep 23, 2021Updated 4 years ago
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆30Apr 16, 2021Updated 4 years ago
- ☆30Mar 2, 2023Updated 3 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 3 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- ☆14Dec 25, 2020Updated 5 years ago
- [TMLR] Unsupervised Network Embedding Beyond Homophily (https://arxiv.org/abs/2203.10866) Resources☆11Mar 21, 2023Updated 2 years ago
- ☆62Jun 16, 2023Updated 2 years ago
- ☆10Oct 7, 2023Updated 2 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆10Aug 3, 2023Updated 2 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- From-Classification-to-Clinical☆12Apr 26, 2024Updated last year
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆10Dec 4, 2020Updated 5 years ago