A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
☆17Apr 23, 2018Updated 8 years ago
Alternatives and similar repositories for ASR
Users that are interested in ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using MFCC feature and DTW algorithm to recognize rumber 0-9☆19Nov 20, 2017Updated 8 years ago
- ☆10May 22, 2023Updated 3 years ago
- 基于DTW与MFCC特征进行数字0-9的语音识别,DTW,MFCC,语音识别,中英数据,端点检测,Digital Voice Recognition。☆42Jul 29, 2021Updated 4 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Acoustic and language models for minorised languages.☆26Sep 30, 2020Updated 5 years ago
- Code for AccentDB.☆24May 28, 2021Updated 5 years ago
- Voice Activity Detection LSTM-RNN learning model☆50Apr 17, 2018Updated 8 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- A repo for Kaggle Competitions☆11Mar 21, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Classify audio samples using a neural network☆10May 19, 2017Updated 9 years ago
- Implementation of Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks in tensorflow☆15Apr 2, 2017Updated 9 years ago
- STT Service based on Kaldi ASR☆15Aug 17, 2018Updated 7 years ago
- ☆10Nov 1, 2025Updated 8 months ago
- DAPP for Nebulas☆21May 15, 2018Updated 8 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Python Audio Search Engine: search for audio .wav files based on percent similarity☆14May 12, 2014Updated 12 years ago
- 预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM☆13Sep 2, 2021Updated 4 years ago
- Speaker Recognition application using fast-forward NN☆16Jun 14, 2012Updated 14 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Scripts to convert audio files to spectrograms and back☆12Nov 23, 2017Updated 8 years ago
- 语音切割,python ,webrtc☆11Sep 28, 2018Updated 7 years ago
- LLVM-based compiler to create artificial software diversity to protect software from code-reuse attacks.☆18Sep 12, 2018Updated 7 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 6 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Implementation trade-offs in using Intel Pin for instruction tracing of complex programs☆15Oct 16, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Quart is a Python asyncio web microframework with the same API as Flask.☆12May 7, 2018Updated 8 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- ☆13Apr 4, 2024Updated 2 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- Chinese Natural Language Correction via Language Model☆15Sep 14, 2017Updated 8 years ago
- Python Japanese codecs by NKF (Network Kanji Filter)☆19Mar 30, 2026Updated 3 months ago
- ☆14Dec 10, 2021Updated 4 years ago