liyidi / soundnet_localize_sound_sourceLinks
soundnet and localize sound source
☆12Updated 5 years ago
Alternatives and similar repositories for soundnet_localize_sound_source
Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below
Sorting:
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆95Updated last year
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Updated 2 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆93Updated 2 years ago
- Data preparation for separation☆78Updated 4 years ago
- Accepted by TMM 2022☆18Updated 3 years ago
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆92Updated 4 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆16Updated 9 months ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆153Updated 3 months ago
- MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection☆20Updated last year
- ☆34Updated last year
- ☆14Updated last year
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Updated 5 years ago
- Domestic environment sound event detection task☆152Updated last year
- ☆13Updated 4 years ago
- Deformable Speech Transformer (DST)☆35Updated last year
- VGGSound: A Large-scale Audio-Visual Dataset☆346Updated 4 years ago
- ☆41Updated 3 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆74Updated 4 years ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆103Updated 3 years ago
- Noise15 , Noisex-92 and Nonspeech☆47Updated 5 years ago
- A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.☆13Updated 5 years ago
- ☆16Updated 6 months ago
- ☆42Updated 5 years ago
- ☆39Updated this week
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆203Updated 5 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Updated 2 years ago
- ☆39Updated last year
- MetricGAN+ PyTorch Implementation☆28Updated last year
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆237Updated last year