zjsong/SSPL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjsong/SSPL)

zjsong / SSPL

PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022)

☆32

Alternatives and similar repositories for SSPL

Users that are interested in SSPL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
stoneMo / EZ-VSL
View on GitHub
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆42Oct 2, 2022Updated 3 years ago
hche11 / Localizing-Visual-Sounds-the-Hard-Way
View on GitHub
Localizing Visual Sounds the Hard Way
☆84Jul 6, 2022Updated 4 years ago
marmot-xy / CMBS
View on GitHub
cross modal background suppression for audio-visual event localization
☆36Mar 18, 2022Updated 4 years ago
OpenNLPLab / FNAC_AVL
View on GitHub
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆29Apr 10, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
RickyMexx / 3D-Sound-Localization
View on GitHub
Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.
☆19Nov 21, 2022Updated 3 years ago
kaistmm / SSLalignment
View on GitHub
☆37May 28, 2025Updated last year
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
shvdiwnkozbw / Multi-Source-Sound-Localization
View on GitHub
This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.
☆96Oct 18, 2021Updated 4 years ago
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
hirokiyokoyama / sound_source_localization
View on GitHub
(planned to) make dataset for sound source localization with two robots that have microphone arrays and speaker, and train CNN-based loca…
☆12Nov 30, 2018Updated 7 years ago
alvinliu0 / Visual-Sound-Localization-in-the-Wild
View on GitHub
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Feb 15, 2022Updated 4 years ago
VisualAIKHU / SIRA-SSL
View on GitHub
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
☆18Nov 14, 2023Updated 2 years ago
catherine-qian / cocosda-SSL
View on GitHub
pytorch code for sound event localization and classification
☆13Aug 12, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YapengTian / AVVP-ECCV20
View on GitHub
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
☆90Jul 25, 2024Updated last year
adrianSRoman / DeepWaveTorch
View on GitHub
DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging (PyTorch implementation)
☆23Jul 4, 2024Updated 2 years ago
BingYang-20 / SRP-DNN
View on GitHub
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
☆66Sep 28, 2024Updated last year
yunyikristy / CM-ACC
View on GitHub
Cross-model active contrastive coding
☆22Mar 17, 2021Updated 5 years ago
afrancl / BinauralLocalizationCNN
View on GitHub
Code to create networks that localize sounds sources in 3D environments
☆53Jan 27, 2024Updated 2 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
denfed / heartheflow
View on GitHub
Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"
☆12Dec 21, 2022Updated 3 years ago
rhgao / co-separation
View on GitHub
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆98Jul 25, 2023Updated 2 years ago
karreny / telling-left-from-right
View on GitHub
Project website for "Telling left from right: Learning spatial correspondence between sight and sound"
☆29Jun 6, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
sherwinbahmani / threed_front_rendering
View on GitHub
☆13Sep 2, 2023Updated 2 years ago
cxy1997 / Sentiment-Analysis-with-RNN-and-CNN
View on GitHub
Project of SJTU-CS438 Internet-based Information Extraction Technologies
☆11Oct 19, 2018Updated 7 years ago
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
hekj / Landmark-RxR
View on GitHub
A human-annotated, fine-grained dataset for Vision-and-Language Navigation
☆17Jan 20, 2022Updated 4 years ago
FloretCat / CMRAN
View on GitHub
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization， ACM MM 2020
☆33Nov 6, 2020Updated 5 years ago
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆107Aug 11, 2023Updated 2 years ago
jinxiang-liu / anno-free-AVS
View on GitHub
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
☆38Oct 11, 2024Updated last year
thomeou / SALSA
View on GitHub
This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.
☆114May 31, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
SolomidHero / speech-regeneration-enhancer
View on GitHub
Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"
☆15May 8, 2021Updated 5 years ago
GeWu-Lab / awesome-audiovisual-learning
View on GitHub
A curated list of audio-visual learning methods and datasets.
☆288Dec 3, 2024Updated last year
FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition
View on GitHub
[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…
☆76Sep 19, 2020Updated 5 years ago
yanbeic / CCL
View on GitHub
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
☆88Jul 7, 2021Updated 5 years ago
Linya-lab / Video_Decaptioning
View on GitHub
☆13Feb 19, 2022Updated 4 years ago
Jinbo-Hu / L3DAS22-TASK2
View on GitHub
A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection
☆23Nov 14, 2024Updated last year