JohannesBuchner / spoken-command-recognitionLinks
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
☆69Updated 7 years ago
Alternatives and similar repositories for spoken-command-recognition
Users that are interested in spoken-command-recognition are comparing it to the libraries listed below
Sorting:
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- A Collection of Speech Corpus for ASR and TTS☆114Updated 7 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- A simple audio feature extraction library☆80Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 4 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆71Updated 6 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 3 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 7 years ago