guglielmocamporese / learning_invariances_in_speech_recognition
In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆19Updated 6 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition:
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 5 years ago
- ☆24Updated 6 years ago
- ☆56Updated 3 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆59Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- A implementation of Power Normalized Cepstral Coefficients: PNCC☆50Updated 5 years ago
- Python toolkit for speech processing☆68Updated last week
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- ☆17Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated last year
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- ☆59Updated 4 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 4 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago