jcvasquezc / AEspeech
Feature extraction from speech signals based on representation learning strategies using pre-trained autoencoders
☆19Updated last year
Alternatives and similar repositories for AEspeech
Users that are interested in AEspeech are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆29Updated 5 years ago
- Implementation of deep recurrent nonnegative matrix factorization (DR-NMF) for speech separation☆49Updated 6 years ago
- measures to assess frequency-weighted instantaneous energy☆17Updated 3 years ago
- Applying discrete wavelet packet transform (DWPT) and nonnegative matrix factorization (NMF) analysis to speech enhancement tasks. Conven…☆11Updated 8 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN…☆14Updated 4 years ago
- Pytorch/Python implementation of the joint CNN-LSTM deep learning model☆25Updated 3 years ago
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆106Updated 3 years ago
- ☆12Updated 4 years ago
- ☆15Updated 5 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆22Updated 5 years ago
- Alzheimer's Disease Recognition Evaluation 2021☆48Updated last week
- Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)☆22Updated 7 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Updated 6 years ago
- Classification of environmental sounds using 1D convolutional Neural network☆35Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 2 years ago
- 1D CNN based classifier for Speech Commands Dataset☆9Updated 7 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆63Updated 4 years ago
- Reference Matlab/Octave implementations of feature extraction algorithms☆32Updated 5 years ago
- Blind Source Separation (BSS) refers to a problem where both the sources and the mixing methodology are unknown, only mixture signals are…☆11Updated 4 years ago
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆29Updated 5 years ago
- Code for Multi Speaker Source Separation with neural networks, build with TensorFlow☆18Updated 4 years ago
- Differentiable short-time Fourier transform (DSTFT): Gradient-based parameters tuning for adaptive time-frequency representation. DSTFT i…☆34Updated last year
- Calculate MFCC/Fbank feature for wav files☆14Updated 7 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Updated 5 years ago
- Audio classification via transfer learning☆33Updated 5 years ago
- Kalman filtering for speech signal enhancement☆20Updated 8 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆23Updated 5 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Updated last year