Video Feature Extractor for S3D-HowTo100M
☆29Apr 30, 2021Updated 5 years ago
Alternatives and similar repositories for VideoFeatureExtractor
Users that are interested in VideoFeatureExtractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 23, 2023Updated 3 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆365Jul 25, 2024Updated last year
- Easy to use video deep features extractor☆322Jul 5, 2020Updated 5 years ago
- Repository for Multimodal AutoML Benchmark☆67Dec 7, 2021Updated 4 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Dec 16, 2022Updated 3 years ago
- Animals3D: Learning Articulated Shape with Keypoint Pseudo-labels from Web Images (CVPR 2023)☆14May 20, 2024Updated 2 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- Code for the HowTo100M paper☆303Mar 10, 2020Updated 6 years ago
- ☆25Mar 4, 2022Updated 4 years ago
- The implementation of our CIKM 2021 paper titled as: "Cross-Market Product Recommendation"☆20Nov 30, 2021Updated 4 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- Cross-model active contrastive coding☆22Mar 17, 2021Updated 5 years ago
- Python implementation of extraction of several visual features representations from videos☆23Jul 19, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 华为digix 2021 赛题1☆29Nov 10, 2021Updated 4 years ago
- ☆21Feb 18, 2022Updated 4 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆44Apr 17, 2023Updated 3 years ago
- ☆22Jun 6, 2020Updated 6 years ago
- Text-Image Relationships (ACL 2019)☆23Sep 15, 2023Updated 2 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆31Mar 2, 2023Updated 3 years ago
- ☆39Sep 23, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 3 years ago
- [TMLR] Unsupervised Network Embedding Beyond Homophily (https://arxiv.org/abs/2203.10866) Resources☆11Mar 21, 2023Updated 3 years ago
- ☆10Oct 7, 2023Updated 2 years ago
- ☆61Jun 16, 2023Updated 3 years ago
- ☆24Jan 6, 2020Updated 6 years ago
- ☆16Aug 28, 2024Updated last year
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- 首届电子商务AI算法大赛TOP2开源代码☆13Aug 31, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆26Jul 16, 2025Updated 11 months ago
- ☆13Jun 26, 2022Updated 4 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆12Aug 3, 2023Updated 2 years ago
- Turning to Video for Transcript Sorting☆50Aug 27, 2023Updated 2 years ago