petewarden / extract_loudest_section
Trims .wav audio files to the loudest section of a given length
☆95Updated 7 years ago
Alternatives and similar repositories for extract_loudest_section:
Users that are interested in extract_loudest_section are comparing it to the libraries listed below
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago
- ☆25Updated 7 years ago
- Voice Activity Detection system (Matlab-based implementation)☆108Updated 7 years ago
- DCASE 2017 Baseline system☆82Updated 4 years ago
- HTK features in Python☆72Updated 6 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- Fetch and use Google's AudioSet dataset☆125Updated 7 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Audio Classifier in Keras using Convolutional Neural Network☆160Updated 5 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 4 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆144Updated 7 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆109Updated 6 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆29Updated 7 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 7 years ago
- Simple Speech Keyword Detecting with Depthwise Separable Convolutions | DLology☆42Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- Deep neural network based speech enhancement toolkit☆213Updated 5 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆128Updated 9 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆204Updated 3 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- Voice Activity Detector☆73Updated 2 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- An opensource speech-to-text software written in tensorflow☆158Updated 2 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆376Updated 2 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆89Updated 7 years ago
- Tutorial on Kaldi for Brandeis ASR course☆76Updated 5 years ago