liyidi / soundnet_localize_sound_sourceLinks
soundnet and localize sound source
☆11Updated 4 years ago
Alternatives and similar repositories for soundnet_localize_sound_source
Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below
Sorting:
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆92Updated 6 months ago
- ☆31Updated 7 months ago
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆85Updated 3 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆52Updated 3 months ago
- Data preparation for separation☆77Updated 4 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆55Updated 3 years ago
- ☆13Updated 11 months ago
- Domestic environment sound event detection task☆145Updated last year
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆73Updated 3 years ago
- A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.☆13Updated 4 years ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆33Updated 4 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆133Updated 3 weeks ago
- CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)☆23Updated last month
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆101Updated 3 years ago
- Accepted by TMM 2022☆16Updated 2 years ago
- ☆37Updated last year
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆14Updated 4 months ago
- Baseline method for sound event localization task of DCASE 2023 challenge☆53Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆44Updated 3 years ago
- ☆39Updated 2 years ago
- MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection☆13Updated 11 months ago
- ☆65Updated 9 months ago
- ☆53Updated 2 months ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆60Updated 4 years ago
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆87Updated 6 years ago
- ☆19Updated 3 months ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆38Updated 2 years ago
- MetricGAN+ PyTorch Implementation☆24Updated last year
- ☆145Updated last year