guglielmocamporese / learning_invariances_in_speech_recognitionLinks
In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆19Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below
Sorting:
- PyTorch reimplementation of per-channel energy normalization for audio.☆101Updated 6 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Updated 7 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Updated 6 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- ☆24Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- Speech recognition with CTC in Keras with Tensorflow backend☆31Updated 2 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- HTK features in Python☆73Updated 6 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆42Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 3 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Updated last year
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago