guglielmocamporese / learning_invariances_in_speech_recognitionLinks
In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆19Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below
Sorting:
- Deep Neural Network for Speaker Count Estimation☆156Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- PyTorch reimplementation of per-channel energy normalization for audio.☆102Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆96Updated 5 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 6 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Voxceleb1 i-vector based speaker recognition system☆44Updated 7 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- An open-source speech separation and enhancement library☆213Updated 5 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Updated 3 years ago
- Voice Activity Detection (VAD) using deep learning.☆201Updated 6 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 6 years ago
- Share some recent speaker recognition papers and their implementations.☆90Updated 6 years ago
- A implementation of Power Normalized Cepstral Coefficients: PNCC☆54Updated 6 years ago
- Python toolkit for speech processing☆72Updated last week
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 5 years ago
- Probabilistic Linear Discriminant Analysis & classification, written in Python.☆129Updated 3 years ago
- Benchmark for sound event localization task of DCASE 2019 challenge☆78Updated 5 years ago