☆10Sep 19, 2018Updated 7 years ago
Alternatives and similar repositories for end-point-detection
Users that are interested in end-point-detection are comparing it to the libraries listed below
Sorting:
- Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository☆26Jul 26, 2020Updated 5 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- a optional way to extract audio feature☆13Jun 10, 2017Updated 8 years ago
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Sep 4, 2019Updated 6 years ago
- Construct GMM-HMM and Implement the Viterbi algorithm for continuous speech recognition☆15Apr 1, 2018Updated 7 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Nov 5, 2017Updated 8 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- "Recurrent Models of Visual Attention" in TensorFlow☆41Apr 13, 2017Updated 8 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Sep 3, 2018Updated 7 years ago
- Deep neural network based speech enhancement toolkit☆219Jun 14, 2019Updated 6 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 5 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- Faster Deep Neural Networks☆37Sep 12, 2017Updated 8 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆36Aug 15, 2019Updated 6 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- ☆15Mar 15, 2022Updated 3 years ago
- Using acceleration and heart rate data to classify awake, deep, and light sleep☆10Dec 21, 2017Updated 8 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Simple implementation of TDOA localization algorithm.☆13Oct 12, 2016Updated 9 years ago
- A signal processing library, currently sufficient for basic speech recognition stuff like mel frequency cepstrum☆19Mar 15, 2012Updated 13 years ago
- ☆11May 4, 2020Updated 5 years ago
- how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)☆12Nov 22, 2019Updated 6 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Python based tool to use text to speech to read books or study material quickly.☆10Sep 22, 2021Updated 4 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- 夏目悠李/男声歌声データベースの最新ラベルデータ☆11Sep 2, 2020Updated 5 years ago
- Semantic dependency relationship extractor untuk bahasa Indonesia... termasuk bahasa gaul dan alay ;) (terinspirasi oleh OpenCog RelEx)☆10Oct 2, 2015Updated 10 years ago
- Implementation of joint bayesian model, written in python.☆11Aug 2, 2021Updated 4 years ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 2 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 8 years ago
- A framework for building speech-enabled websites.☆10Jul 10, 2015Updated 10 years ago