Temporal Compact Bilinear Pooling (TCBP)
☆11May 27, 2020Updated 5 years ago
Alternatives and similar repositories for tcbp
Users that are interested in tcbp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆23May 17, 2021Updated 4 years ago
- ☆13May 10, 2025Updated 10 months ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Jun 30, 2020Updated 5 years ago
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Jan 23, 2023Updated 3 years ago
- Just a data☆11Oct 20, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Dec 23, 2018Updated 7 years ago
- Official code of *Towards Event-oriented Long Video Understanding*☆12Jul 26, 2024Updated last year
- ☆31Jun 18, 2021Updated 4 years ago
- ComputeR vIsion for Sport Performance☆11May 14, 2024Updated last year
- Python program to generate, draw, and analyze spectral networks of class S theories☆12Mar 13, 2020Updated 6 years ago
- ☆12Jun 9, 2018Updated 7 years ago
- ☆10May 24, 2023Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- Latex template for Oxford integrated thesis☆19Apr 7, 2025Updated 11 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Sep 27, 2021Updated 4 years ago
- Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]☆13May 6, 2025Updated 10 months ago
- Reading list for multimodal sequence learning☆14Sep 4, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆17Sep 25, 2023Updated 2 years ago
- ☆41May 7, 2022Updated 3 years ago
- Labeled Movie Trailer Dataset☆16Mar 23, 2018Updated 8 years ago
- ☆53Oct 16, 2023Updated 2 years ago
- Code for "Distributed, Egocentric Representations of Graphs for Detecting Critical Structures" (ICML 2019)☆20Aug 24, 2021Updated 4 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20May 27, 2025Updated 10 months ago
- tf&torch about nlp☆11Aug 12, 2022Updated 3 years ago
- ☆13Jul 20, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Rotation equivariance meets local feature matching☆18Oct 20, 2022Updated 3 years ago
- ☆19Jan 30, 2023Updated 3 years ago
- Tools to distill the Hiera transformer backbone to CNNs that are easier to deploy on the edge.☆15Dec 4, 2024Updated last year
- YOLO-World-ONNX is a Python package for running inference on YOLO-WORLD Open-vocabulary-object detection model using ONNX models. It prov…☆16Feb 6, 2026Updated last month
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Nov 7, 2023Updated 2 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆17Dec 20, 2022Updated 3 years ago
- Includes the code for training and testing the CountGD++ model from the paper CountGD++: Generalized Prompting for Open-World Counting.☆32Feb 25, 2026Updated last month