liyidi / soundnet_localize_sound_source
soundnet and localize sound source
☆11Updated 4 years ago
Alternatives and similar repositories for soundnet_localize_sound_source:
Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆90Updated 4 months ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆33Updated 4 years ago
- Accepted by TMM 2022☆16Updated 2 years ago
- ☆41Updated 4 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Updated 2 years ago
- ☆31Updated 5 months ago
- cross modal background suppression for audio-visual event localization☆35Updated 3 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆71Updated 3 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆207Updated 2 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆51Updated last month
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆83Updated 3 years ago
- A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.☆13Updated 4 years ago
- ☆37Updated 10 months ago
- ☆33Updated 5 months ago
- Data preparation for separation☆76Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2023 challenge☆50Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆128Updated 6 months ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆55Updated 2 years ago
- Domestic environment sound event detection task☆142Updated 10 months ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆12Updated 2 months ago
- Voice Face Association Learning Paper List☆15Updated last year
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆89Updated last year
- ☆13Updated 9 months ago
- Deformable Speech Transformer (DST)☆31Updated 8 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆38Updated 9 months ago
- MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection☆11Updated 9 months ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆129Updated 4 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆149Updated 3 years ago
- ☆107Updated 2 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆169Updated last year