A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
☆17Apr 23, 2018Updated 8 years ago
Alternatives and similar repositories for ASR
Users that are interested in ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using MFCC feature and DTW algorithm to recognize rumber 0-9☆19Nov 20, 2017Updated 8 years ago
- 基于DTW与MFCC特征进行数字0-9的语音识别,DTW,MFCC,语音识别,中英数据,端点检测,Digital Voice Recognition。☆43Jul 29, 2021Updated 4 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- Construct GMM-HMM and Implement the Viterbi algorithm for continuous speech recognition☆15Apr 1, 2018Updated 8 years ago
- Counts frequencies of words using movie and television subtitles.☆20Jan 26, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆18Jun 27, 2017Updated 8 years ago
- Acoustic and language models for minorised languages.☆26Sep 30, 2020Updated 5 years ago
- Code for AccentDB.☆23May 28, 2021Updated 4 years ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17May 13, 2014Updated 11 years ago
- Voice Activity Detection LSTM-RNN learning model☆50Apr 17, 2018Updated 8 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Jun 25, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12May 6, 2020Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- A repo for Kaggle Competitions☆11Mar 21, 2018Updated 8 years ago
- Classify audio samples using a neural network☆10May 19, 2017Updated 8 years ago
- Implementation of Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks in tensorflow☆15Apr 2, 2017Updated 9 years ago
- ☆10Nov 1, 2025Updated 6 months ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- A python wrapper for kaldi-online-decoder using Cython☆12Sep 1, 2017Updated 8 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Speaker Recognition application using fast-forward NN☆16Jun 14, 2012Updated 13 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 9 years ago
- The PT tracing portion of Barnum.☆11Feb 8, 2019Updated 7 years ago
- Scripts to convert audio files to spectrograms and back☆12Nov 23, 2017Updated 8 years ago
- 语音切割,python ,webrtc☆11Sep 28, 2018Updated 7 years ago
- LLVM-based compiler to create artificial software diversity to protect software from code-reuse attacks.☆18Sep 12, 2018Updated 7 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- This project is aim to develope a 2D "chicken dinner"☆15Aug 18, 2018Updated 7 years ago
- Implementation trade-offs in using Intel Pin for instruction tracing of complex programs☆15Oct 16, 2019Updated 6 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- Quart is a Python asyncio web microframework with the same API as Flask.☆12May 7, 2018Updated 7 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- ☆13Apr 4, 2024Updated 2 years ago