guglielmocamporese / learning_invariances_in_speech_recognitionLinks
In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆19Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below
Sorting:
- ☆24Updated 6 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆44Updated 2 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆30Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated last year
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Updated last year
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 11 months ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆91Updated 4 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- ☆37Updated 2 months ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆33Updated 6 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated 2 years ago