petewarden / extract_loudest_section
Trims .wav audio files to the loudest section of a given length
☆95Updated 7 years ago
Alternatives and similar repositories for extract_loudest_section:
Users that are interested in extract_loudest_section are comparing it to the libraries listed below
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 3 years ago
- Voice Activity Detection system (Matlab-based implementation)☆107Updated 7 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆109Updated 6 years ago
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- ☆25Updated 7 years ago
- ASR with PyTorch☆140Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆71Updated 5 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆144Updated 7 years ago
- Web application to record speech for an open data set☆421Updated 4 years ago
- Deep neural network based speech enhancement toolkit☆212Updated 5 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 7 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆89Updated 6 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆375Updated last year
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆28Updated 7 years ago
- ☆130Updated 6 years ago
- Tutorial on Kaldi for Brandeis ASR course☆76Updated 5 years ago
- PyTorch implementations of neural network models for keyword spotting☆514Updated last year
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆309Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆137Updated 3 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 4 years ago