liyidi / soundnet_localize_sound_sourceLinks
soundnet and localize sound source
☆11Updated 4 years ago
Alternatives and similar repositories for soundnet_localize_sound_source
Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below
Sorting:
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆92Updated 8 months ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆89Updated 2 years ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆33Updated 4 years ago
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆85Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆24Updated 4 years ago
- Domestic environment sound event detection task☆144Updated last year
- Accepted by TMM 2022☆16Updated 2 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆210Updated 2 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆177Updated last year
- ☆33Updated 8 months ago
- Deformable Speech Transformer (DST)☆33Updated last year
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆130Updated 5 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆14Updated 5 months ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆148Updated last year
- ☆41Updated 4 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆204Updated 5 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆73Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆324Updated 3 years ago
- Voice Face Association Learning Paper List☆16Updated 2 years ago
- alaaNfissi / SigWavNet-Learning-Multiresolution-Signal-Wavelet-Network-for-Speech-Emotion-RecognitionThis paper has been accepted for publication in IEEE Transactions on Affective Computing.☆17Updated 5 months ago
- ☆39Updated 2 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆151Updated 3 years ago
- Data preparation for separation☆77Updated 4 years ago
- ☆10Updated 2 years ago
- ☆13Updated last year
- ☆39Updated 8 months ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆233Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆137Updated last week
- 语音增强☆17Updated 4 years ago
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆192Updated 4 years ago