AKBoles / Deep-Learning-Speech-RecognitionView external linksLinks
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
☆50Feb 1, 2017Updated 9 years ago
Alternatives and similar repositories for Deep-Learning-Speech-Recognition
Users that are interested in Deep-Learning-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Unsupervised Speaker Clustering & Speaker Recognition☆13Jan 7, 2019Updated 7 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- Speaker recognition/identification system in Python. Python3 port.☆14May 2, 2015Updated 10 years ago
- ☆65Dec 20, 2013Updated 12 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Speaker diarization scripts, based on AaltoASR☆191Jan 3, 2019Updated 7 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Sep 18, 2017Updated 8 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆88Feb 23, 2018Updated 7 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Character level speech recognizer using ctc loss with deep rnns in TensorFlow.☆78Jun 9, 2018Updated 7 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆16Aug 31, 2017Updated 8 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- Speaker recognition library based on MARF for raspberry pi and other SBCs.☆57Jan 16, 2018Updated 8 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆32May 30, 2018Updated 7 years ago
- Time-domain Audio Separation Network☆24Aug 3, 2018Updated 7 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Nov 25, 2016Updated 9 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆122Jul 6, 2017Updated 8 years ago
- Automatic Speaker Recognition algorithms in Python☆96Sep 25, 2021Updated 4 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 10 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Aug 6, 2015Updated 10 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- A collection of trending speech enhancement papers☆11Dec 4, 2020Updated 5 years ago