petewarden / extract_loudest_sectionLinks
Trims .wav audio files to the loudest section of a given length
☆98Updated 7 years ago
Alternatives and similar repositories for extract_loudest_section
Users that are interested in extract_loudest_section are comparing it to the libraries listed below
Sorting:
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆173Updated 8 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 4 years ago
- Keras Interface for Kaldi ASR☆122Updated 8 years ago
- HTK features in Python☆73Updated last week
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆145Updated 8 years ago
- Simple Speech Keyword Detecting with Depthwise Separable Convolutions | DLology☆42Updated 7 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Updated 6 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- Voice Activity Detection system (Matlab-based implementation)☆108Updated 8 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 8 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆380Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- End-to-end speech recognition using TensorFlow☆50Updated 7 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 8 years ago
- PyTorch implementations of neural network models for keyword spotting☆518Updated 2 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆125Updated 6 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 6 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Updated 6 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆75Updated 4 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 8 years ago
- An opensource speech-to-text software written in tensorflow☆160Updated 3 years ago
- ☆26Updated 8 years ago
- DCASE 2017 Baseline system☆82Updated 5 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆169Updated 6 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆29Updated 7 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆86Updated 6 years ago
- A neural attention model for speech command recognition☆187Updated 4 months ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- ASR with PyTorch☆140Updated 6 years ago