The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Mar 9, 2022Updated 3 years ago
Alternatives and similar repositories for Temporal_Query_Networks
Users that are interested in Temporal_Query_Networks are comparing it to the libraries listed below
Sorting:
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆119Oct 9, 2023Updated 2 years ago
- Code for our paper "HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition".☆14Jan 3, 2023Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 3 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆23May 17, 2021Updated 4 years ago
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆101Oct 30, 2022Updated 3 years ago
- ☆36Apr 14, 2021Updated 4 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- implementation of "Action Quality Assessment with Temporal Parsing Transformer"☆24Aug 2, 2022Updated 3 years ago
- ☆27Jul 18, 2025Updated 7 months ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Nov 16, 2022Updated 3 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆27Jan 3, 2023Updated 3 years ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62May 25, 2022Updated 3 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆100May 13, 2021Updated 4 years ago
- ☆43Mar 8, 2021Updated 4 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated 9 months ago
- RareAct: A video dataset of unusual interactions☆33Aug 4, 2020Updated 5 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- V4D: 4D Convolutional Neural Networks for Video-level Representation Learning☆70Oct 22, 2020Updated 5 years ago
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Nov 5, 2023Updated 2 years ago
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆131Jul 5, 2021Updated 4 years ago
- ☆87Mar 4, 2024Updated last year
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- [ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos☆127Sep 29, 2023Updated 2 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- [BMVC 2021] A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark☆43Dec 2, 2021Updated 4 years ago
- Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition☆14Dec 22, 2022Updated 3 years ago
- ☆12Mar 12, 2023Updated 2 years ago
- This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.☆257Oct 8, 2021Updated 4 years ago
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Dec 9, 2024Updated last year
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- Code for our CVPR 2023 paper "MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition".☆50Mar 7, 2024Updated last year
- Dense Regression Network for Video Grounding (CVPR2020)☆53Jan 28, 2021Updated 5 years ago
- Preprocess the activityNet dataset for detection task☆13Mar 3, 2017Updated 8 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago