facebookresearch/Listen-to-Look

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/Listen-to-Look)

facebookresearch / Listen-to-Look

Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)

☆130

Alternatives and similar repositories for Listen-to-Look

Users that are interested in Listen-to-Look are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / 2.5D-Visual-Sound
View on GitHub
2.5D visual sound
☆121Jul 25, 2023Updated 3 years ago
ekazakos / temporal-binding-network
View on GitHub
Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
☆112Jan 25, 2021Updated 5 years ago
rhgao / co-separation
View on GitHub
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆98Jul 25, 2023Updated 3 years ago
mengyuest / AR-Net
View on GitHub
[ECCV2020] Learn optimal resolution and skipping mechanism for efficient video understanding
☆63Aug 17, 2020Updated 5 years ago
kennymckormick / ARAS-Dataset
View on GitHub
☆11Nov 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YapengTian / AVVP-ECCV20
View on GitHub
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
☆90Jul 25, 2024Updated 2 years ago
YapengTian / AVE-ECCV18
View on GitHub
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆210Apr 3, 2021Updated 5 years ago
tobyperrett / few-shot-action-recognition
View on GitHub
Implementations of some few-shot action recognition methods.
☆43Jun 7, 2021Updated 5 years ago
rhgao / Deep-MIML-Network
View on GitHub
Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)
☆50Sep 24, 2019Updated 6 years ago
zbwglory / CMHSE
View on GitHub
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆20Apr 26, 2020Updated 6 years ago
decisionforce / TPN
View on GitHub
[CVPR 2020] Temporal Pyramid Network for Action Recognition
☆394Jan 12, 2021Updated 5 years ago
facebookresearch / FAIR-Play
View on GitHub
2.5D visual sound dataset
☆108Sep 21, 2021Updated 4 years ago
yanbeic / CCL
View on GitHub
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
☆88Jul 7, 2021Updated 5 years ago
epic-kitchens / epic-kitchens-55-action-models
View on GitHub
EPIC-KITCHENS-55 baselines for Action Recognition
☆75Jul 14, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pujols / Video-summarization
View on GitHub
☆18Jan 29, 2020Updated 6 years ago
neuroailab / VIE
View on GitHub
Codes for "Unsupervised Learning from Video with Deep Neural Embeddings"
☆82Jun 7, 2021Updated 5 years ago
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
fanyix / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆14Aug 11, 2020Updated 5 years ago
mengyuest / AdaFuse
View on GitHub
[ICLR2021] AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
☆35Apr 8, 2021Updated 5 years ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
swathikirans / GSM
View on GitHub
Gate-Shift Networks for Video Action Recognition - CVPR 2020
☆149Jun 19, 2020Updated 6 years ago
amazon-science / video-contrastive-learning
View on GitHub
Video Contrastive Learning with Global Context, ICCVW 2021
☆162May 30, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JuanFMontesinos / Solos
View on GitHub
Solos: A Dataset for Audio-Visual Music Analysis
☆24Feb 17, 2023Updated 3 years ago
torchexpo / torchexpo
View on GitHub
Collection of models and extensions for deployment in PyTorch
☆24Nov 20, 2022Updated 3 years ago
v-iashin / BMT
View on GitHub
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
☆231Apr 8, 2023Updated 3 years ago
kdexd / virtex
View on GitHub
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
☆561Aug 22, 2025Updated 11 months ago
niluthpol / weak_supervised_video_moment
View on GitHub
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Jul 20, 2020Updated 6 years ago
naver-ai / lut
View on GitHub
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆14Dec 1, 2024Updated last year
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
Qualcomm-AI-research / FrameExit
View on GitHub
☆37Jul 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
laura-wang / video-pace
View on GitHub
code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction
☆100May 13, 2021Updated 5 years ago
MichiganCOG / A2CL-PT
View on GitHub
Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020)
☆47Jul 20, 2023Updated 3 years ago
kkahatapitiya / X3D-Multigrid
View on GitHub
PyTorch implementation of X3D models with Multigrid training.
☆103Oct 10, 2021Updated 4 years ago
fpv-iplab / rulstm
View on GitHub
Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…
☆137Aug 23, 2023Updated 2 years ago
facebookresearch / VMZ
View on GitHub
VMZ: Model Zoo for Video Modeling
☆1,054Jun 17, 2025Updated last year
facebookresearch / video-long-term-feature-banks
View on GitHub
Long-Term Feature Banks for Detailed Video Understanding
☆383Aug 30, 2021Updated 4 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,394Mar 16, 2026Updated 4 months ago