webstah / self-supervised-bss-via-multi-encoder-aeLinks
Official repository for "Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders".
☆15Updated this week
Alternatives and similar repositories for self-supervised-bss-via-multi-encoder-ae
Users that are interested in self-supervised-bss-via-multi-encoder-ae are comparing it to the libraries listed below
Sorting:
- For accessing to the dataset, please send your short bio and objective of the study to Dr.Theerawit Wilaiprasitporn (theerawit dot w at v…☆14Updated 4 years ago
- Differentiable short-time Fourier transform (DSTFT): Gradient-based parameters tuning for adaptive time-frequency representation. DSTFT i…☆44Updated 2 weeks ago
- ICSD Dataset☆30Updated last month
- MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection☆9Updated 9 months ago
- ☆14Updated 11 months ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆13Updated 3 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆22Updated 2 years ago
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆17Updated last year
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆13Updated 2 years ago
- DBPNet model☆41Updated 7 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated last month
- Implementation for "SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification," in pytorch.☆24Updated last year
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Updated 2 years ago
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- code and speech demo for speech reconstruction from ECoG recordings☆13Updated last month
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆12Updated last year
- ☆11Updated 2 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆45Updated 2 years ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- ☆13Updated 9 months ago
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆56Updated last year
- This paper has been accepted in ACM ICMR 2021.☆20Updated 3 years ago
- ☆16Updated last month
- Feature extraction from speech signals based on representation learning strategies using pre-trained autoencoders☆19Updated 2 years ago
- Independent Vector Analysis (IVA-G and IVA-L-SOS) implemented in Python☆19Updated 3 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- ☆17Updated last year
- A Vital Signal Analysis Package☆22Updated last year