crawles/dtw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/crawles/dtw)

crawles / dtw

Simple speech recognition using dynamic time warping with examples

☆29

Alternatives and similar repositories for dtw

Users that are interested in dtw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mrouvier / plda
View on GitHub
Probabilistic Linear Discriminant Analysis
☆14Nov 14, 2014Updated 11 years ago
lukeinator42 / transfer_learning_sound_classification
View on GitHub
☆17Jul 17, 2017Updated 9 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
aishoot / DTWSpeech
View on GitHub
A simple application of DTW Algorithm in isolate word speech recognition.
☆17Mar 9, 2020Updated 6 years ago
ejhumphrey / dl4mir-dissertation
View on GitHub
Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.
☆14Feb 23, 2016Updated 10 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
qqueing / SR_with_kaldi
View on GitHub
Speaker embedding(verification and recognition) using Tensorflow with Kaldi
☆41Sep 18, 2017Updated 8 years ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
kamperh / recipe_swbd_wordembeds
View on GitHub
☆22Mar 22, 2017Updated 9 years ago
torogmw / MusicSegmentation
View on GitHub
a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…
☆16Apr 19, 2013Updated 13 years ago
aihpi / pilotproject-leichte-sprache
View on GitHub
Simplify German language! Leichte Sprache Tool.
☆12May 19, 2026Updated 2 months ago
rakshithShetty / dnn-speech
View on GitHub
This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition
☆12Dec 8, 2015Updated 10 years ago
shane-settle / neural-acoustic-word-embeddings
View on GitHub
☆45Apr 5, 2019Updated 7 years ago
david-ryan-snyder / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆10Aug 22, 2019Updated 6 years ago
mlml / autovot
View on GitHub
Trainable algorithm for automatic measurement of voice onset time
☆69Jul 26, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
sdrobert / pydrobert-kaldi
View on GitHub
SWIG bindings for Kaldi I/O, built with Conda
☆15Dec 15, 2024Updated last year
udibr / LRE
View on GitHub
NIST Language i-vector Machine Learning Challenge
☆27Sep 15, 2016Updated 9 years ago
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago
dzhelonkin / kaldi_kws
View on GitHub
Keyword spotting by Kaldi library
☆26Oct 26, 2016Updated 9 years ago
kamperh / speech_dtw
View on GitHub
Dynamic time warping (DTW) functions for specifically speech alignment.
☆30May 6, 2024Updated 2 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
jcsilva / deep-clustering
View on GitHub
☆70Feb 16, 2017Updated 9 years ago
PeterGilles / Luxembourgish-language-resources
View on GitHub
language resources for Luxembourgish
☆14Jul 20, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tachi-hi / tts_samples
View on GitHub
Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…
☆15May 30, 2021Updated 5 years ago
mravanelli / theano-kaldi-rnn
View on GitHub
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…
☆34Apr 15, 2018Updated 8 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
yakouyang / VAD
View on GitHub
voice active detection (python ver/simple and easy-to-use)
☆12May 1, 2017Updated 9 years ago
AI-Guru / SincNet
View on GitHub
Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)
☆12Aug 5, 2018Updated 7 years ago
IPS-LMU / soundChangeR
View on GitHub
soundChangeR: an agent-based model for simulating sound change
☆15Sep 3, 2025Updated 10 months ago
bmcfee / laplacian_segmentation
View on GitHub
graph laplacian song segmentation
☆18Apr 5, 2016Updated 10 years ago
hanshounsu / d3rm
View on GitHub
☆14Feb 3, 2026Updated 5 months ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
idiap / phonvoc
View on GitHub
Phonetic and phonological vocoding platform
☆17Nov 23, 2016Updated 9 years ago
CaydenPierce / MSA
View on GitHub
Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition
☆18May 12, 2022Updated 4 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
danFromTelAviv / key_words_spotting
View on GitHub
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆38Dec 8, 2019Updated 6 years ago
galv / voice-conversion
View on GitHub
torch7 module to convert one person's voice to another's.
☆16Jan 9, 2016Updated 10 years ago
facebookresearch / codraw-models
View on GitHub
Models for the Collaborative Drawing (CoDraw) task
☆14Jan 15, 2019Updated 7 years ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago