guglielmocamporese / learning_invariances_in_speech_recognitionLinks
In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆19Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below
Sorting:
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- ☆24Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Deep Neural Network for Speaker Count Estimation☆155Updated 5 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- PyTorch reimplementation of per-channel energy normalization for audio.☆101Updated 6 years ago
- Python toolkit for speech processing☆71Updated 3 weeks ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Updated 7 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated 2 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- A implementation of Power Normalized Cepstral Coefficients: PNCC☆53Updated 6 years ago
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆71Updated 3 weeks ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆55Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Making Espnet easier to use☆56Updated 4 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago