visipedia/ssw60

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/visipedia/ssw60)

visipedia / ssw60

Sapsucker Woods 60 Audiovisual Dataset

☆19

Alternatives and similar repositories for ssw60

Users that are interested in ssw60 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeWu-Lab / MMCosine_ICASSP23
View on GitHub
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
☆26May 18, 2023Updated 3 years ago
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
PRIS-CV / Making-a-Bird-AI-Expert-Work-for-You-and-Me
View on GitHub
Code release for "Making a Bird AI Expert Work for You and Me (TPAMI 2023)".
☆16May 4, 2023Updated 3 years ago
visipedia / newt
View on GitHub
Natural World Tasks
☆45Oct 2, 2023Updated 2 years ago
MehmetAygun / demistfy_correspondence
View on GitHub
Code for the ECCV22 paper Demystifying Unsupervised Semantic Correspondence Estimation
☆14Oct 18, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
samuelyu2002 / PACS
View on GitHub
Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)
☆18Dec 20, 2022Updated 3 years ago
UCSC-VLAA / Sight-Beyond-Text
View on GitHub
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
☆20Sep 15, 2023Updated 2 years ago
YuanGongND / uavm
View on GitHub
Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".
☆57Apr 20, 2023Updated 3 years ago
dpmlab / img2fmri
View on GitHub
A python package for predicting group-level fMRI responses to visual stimuli using deep neural networks
☆13Mar 31, 2025Updated last year
usc-sail / mica-subtitle-aligned-movie-sounds
View on GitHub
A dataset for Audio-Visual Sound Event Detection in Movies
☆26Jan 23, 2023Updated 3 years ago
macaodha / batdetect2_GUI
View on GitHub
☆14Nov 29, 2025Updated 7 months ago
ExplainableML / TCAF-GZSL
View on GitHub
This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"
☆25Sep 12, 2025Updated 10 months ago
eisneim / clip-vip_video_search
View on GitHub
showing how to use CLIP-Vip to do video search
☆16Nov 16, 2023Updated 2 years ago
klauscc / VindLU
View on GitHub
☆108Dec 23, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sony / CLIPSep
View on GitHub
☆43Feb 21, 2023Updated 3 years ago
v-iashin / SparseSync
View on GitHub
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
☆56Jan 29, 2024Updated 2 years ago
ospanbatyr / sample-efficient-multimodality
View on GitHub
Code for the "Sample-efficient Integration of New Modalities into Large Language Models" paper
☆18Sep 8, 2025Updated 10 months ago
GenjiB / ECLIPSE
View on GitHub
☆33Mar 10, 2023Updated 3 years ago
GeWu-Lab / awesome-audiovisual-learning
View on GitHub
A curated list of audio-visual learning methods and datasets.
☆288Dec 3, 2024Updated last year
Ydkwim / CTAL
View on GitHub
Pre-training Cross-modal Transformer for Audio-and-Language Representations
☆39Apr 20, 2021Updated 5 years ago
DTaoo / Discriminative-Sounding-Objects-Localization
View on GitHub
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
☆61Jan 19, 2022Updated 4 years ago
VICO-UoE / SphericalMaps
View on GitHub
Improving Semantic Correspondences with Viewpoint-Guided Spherical Maps (CVPR 2024)
☆25Dec 4, 2024Updated last year
google / auto-arborist
View on GitHub
☆18Jun 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yunyikristy / global_local
View on GitHub
☆14Oct 7, 2021Updated 4 years ago
jalayrac / object-states-action
View on GitHub
Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017
☆14Aug 7, 2018Updated 7 years ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
soCzech / ChangeIt
View on GitHub
ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022
☆11Mar 23, 2022Updated 4 years ago
Shahzadnit / EZ-CLIP
View on GitHub
☆24May 11, 2025Updated last year
ubc-vision / TriBERT
View on GitHub
Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…
☆14Dec 9, 2021Updated 4 years ago
ekazakos / auditory-slow-fast
View on GitHub
Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch
☆73Sep 27, 2021Updated 4 years ago
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
azuxmioy / fpvsum
View on GitHub
FPVSum : First-Person Video Summarization dataset
☆12Aug 31, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SMILE-data / SMILE
View on GitHub
SMILE: A Multimodal Dataset for Understanding Laughter
☆13Jun 15, 2023Updated 3 years ago
gary23ai / awesome_concept_learning_list
View on GitHub
A curated list of papers & resources linked to concept learning
☆13Aug 9, 2023Updated 2 years ago
vivoutlaw / tcbp
View on GitHub
Temporal Compact Bilinear Pooling (TCBP)
☆11May 27, 2020Updated 6 years ago
Attila94 / CEConv
View on GitHub
Official repository for Color Equivariant Convolutional Networks.
☆10Nov 16, 2023Updated 2 years ago
bpiyush / rotation-equivariant-lfm
View on GitHub
Rotation equivariance meets local feature matching
☆18Oct 20, 2022Updated 3 years ago
carolinahiguera / Tactile-Diffusion
View on GitHub
☆38May 9, 2023Updated 3 years ago
princetonvisualai / OverlookedFactors
View on GitHub
Overlooked Factors in Concept-based Explanations: Dataset Choice, Concept Learnability, and Human Capability (CVPR 2023)
☆10Mar 14, 2023Updated 3 years ago