jcvasquezc / AEspeechLinks
Feature extraction from speech signals based on representation learning strategies using pre-trained autoencoders
☆19Updated last year
Alternatives and similar repositories for AEspeech
Users that are interested in AEspeech are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆29Updated 5 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆22Updated 5 years ago
- Implementation of deep recurrent nonnegative matrix factorization (DR-NMF) for speech separation☆49Updated 6 years ago
- Applying discrete wavelet packet transform (DWPT) and nonnegative matrix factorization (NMF) analysis to speech enhancement tasks. Conven…☆11Updated 8 years ago
- The code for DCASE2021 task5 submission.☆20Updated 3 years ago
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆30Updated 5 years ago
- Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]☆33Updated 6 years ago
- ☆15Updated 5 years ago
- Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN…☆14Updated 4 years ago
- 1D CNN based classifier for Speech Commands Dataset☆9Updated 7 years ago
- Differentiable short-time Fourier transform (DSTFT): Gradient-based parameters tuning for adaptive time-frequency representation. DSTFT i…☆35Updated last year
- Matlab tools for pathological voice analysis☆13Updated 2 years ago
- Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)☆22Updated 7 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆64Updated 4 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆64Updated 9 months ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Updated 6 years ago
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆21Updated 6 years ago
- Code for Multi Speaker Source Separation with neural networks, build with TensorFlow☆18Updated 4 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆92Updated 2 years ago
- This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset …☆11Updated 2 years ago
- Generative Adversarial Network implemented for the Time-Frequency based Speech Enhancement☆9Updated last year
- Audio MNIST Classification using 1D-CNN, 2D-CNN, GAN+2D-CNN, CVN+RandomForest, and LSTMs.☆14Updated 3 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- measures to assess frequency-weighted instantaneous energy☆17Updated 3 years ago
- Calculate MFCC/Fbank feature for wav files☆14Updated 7 years ago
- ☆18Updated 4 years ago
- ☆12Updated 2 years ago
- ☆12Updated 4 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 3 years ago