liyidi / soundnet_localize_sound_sourceLinks
soundnet and localize sound source
☆11Updated 4 years ago
Alternatives and similar repositories for soundnet_localize_sound_source
Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below
Sorting:
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆92Updated 9 months ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆33Updated 4 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆15Updated 6 months ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆211Updated 2 years ago
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆85Updated 3 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Updated 2 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆90Updated 2 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆24Updated 4 years ago
- Accepted by TMM 2022☆17Updated 3 years ago
- ☆10Updated 2 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆131Updated 5 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆327Updated 3 years ago
- ☆13Updated last year
- ☆11Updated 4 years ago
- cross modal background suppression for audio-visual event localization☆36Updated 3 years ago
- ☆41Updated 4 years ago
- ☆33Updated 9 months ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆73Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆55Updated 3 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆190Updated 4 years ago
- Domestic environment sound event detection task☆146Updated last year
- ☆37Updated last year
- ☆40Updated 2 years ago
- ☆39Updated 9 months ago
- Conformer: Convolution-augmented Transformer for Speech Recognition☆10Updated 3 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆140Updated last week
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆151Updated last year
- 语音增强☆17Updated 4 years ago
- Noise15 , Noisex-92 and Nonspeech☆45Updated 4 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆53Updated 5 months ago