Using MFCC feature and DTW algorithm to recognize rumber 0-9
☆19Nov 20, 2017Updated 8 years ago
Alternatives and similar repositories for DTW-Speech-Recognition
Users that are interested in DTW-Speech-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech …☆17Apr 23, 2018Updated 8 years ago
- speech recognition of digits based on single Gaussian, Gaussian Mixture, and Hidden Markov Models☆11Jun 3, 2020Updated 5 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆18Jun 21, 2022Updated 3 years ago
- 基于DTW与MFCC特征进行数字0-9的语音识别,DTW,MFCC,语音识别,中英数据,端点检测,Digital Voice Recognition。☆43Jul 29, 2021Updated 4 years ago
- 基于HMM与MFCC特征进行数字0-9的语音识别,HMM,GMMHMM,MFCC,语音识别,sklearn,Digital Voice Recognition。☆17Jun 22, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10May 22, 2023Updated 3 years ago
- 一个试图通过语音及识别后的文字捕捉心情感受的小程序☆10May 2, 2019Updated 7 years ago
- Acoustic event detection using recurrent neural networks.☆11Sep 4, 2018Updated 7 years ago
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- Mispronunciation detection code for jingju singing voice☆19Sep 5, 2018Updated 7 years ago
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆18Jun 27, 2017Updated 8 years ago
- ☆22Jul 28, 2018Updated 7 years ago
- 这是一个提供文章朗读的Chrome插件,解放双眼,聆听世界。☆26Mar 8, 2021Updated 5 years ago
- 给定一张身份证正、反面,识别身份证上的所有文字信息☆10Sep 4, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Acoustic and language models for minorised languages.☆26Sep 30, 2020Updated 5 years ago
- 同名论文消歧的工程化方案(参考2019智源-aminer人名消歧竞赛第一名方案)☆26Dec 8, 2022Updated 3 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago
- PyTorch Implementation of CLEAN-Contact: Contrastive Learning-enabled Enzyme Functional Annotation Prediction with Structural Inference☆11May 29, 2024Updated 2 years ago
- Efficient voice activity detection algorithm using long-term speech information☆46Jan 9, 2018Updated 8 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Jan 6, 2020Updated 6 years ago
- A deep learning-based predictor of enzyme optimal pH☆10Jun 25, 2025Updated 11 months ago
- ☆11May 31, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- STURM-Flood dataset repository☆22Apr 12, 2025Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 6 months ago
- 🎉 Repo For My Undergrad Final Year Project "Course Q&A System Based on Fusion of Large Language Models(LLMs) With Knowledge Graphs"☆12Mar 7, 2024Updated 2 years ago
- ☆30Apr 23, 2026Updated last month
- RBM+BP神经网络识别手写数字和英文字符☆11Mar 25, 2023Updated 3 years ago
- 端到端的中文场景文字识别。☆12Jun 27, 2022Updated 3 years ago
- This repo is a fork of the ps-genai-agents repository. It contains the code demonstrated in the Function Calling in Agentic Workflows Med…☆18Mar 4, 2025Updated last year
- ☆16May 31, 2024Updated last year
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PocketSphinx_Speech_Recognition☆10Aug 5, 2021Updated 4 years ago
- Variational Autoencoder (VAE)-based molecular SMILES string generator☆15Apr 23, 2025Updated last year
- ChemDataExtractor 2.1 that has been modified for extracting properties of molecular thermally-activated delayed fluorescent (TADF) materi…☆13May 23, 2025Updated last year
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Jun 25, 2019Updated 6 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Nov 5, 2022Updated 3 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Jul 4, 2018Updated 7 years ago