chunan / libdtw
An implementation of DTW for spoken term detection. Including non-constrained, segmental DTW, slope-constrained versions. For more detail see https://drive.google.com/file/d/1TqvicnaZABsRSHIh3PjJWVRic495qcEP/view?usp=sharing.
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for libdtw
- Neural Turing machine for source separation in Tensorflow☆18Updated 7 years ago
- Voice Activity Detection☆42Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Singing-Voice Separation From Monaural Recordings Using Robust Principal Component Analysis☆66Updated 3 years ago
- DenseNets for the detection of singing birds in audio files☆17Updated 7 years ago
- Multiple Fundamental Frequency Estimation☆26Updated 10 years ago
- a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.☆35Updated 9 years ago
- DCASE 2016 Baseline system, python implementation☆51Updated 7 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 7 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- Pulse Model vocoder☆41Updated 5 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 5 years ago
- Kaldi Speech Processing Tools☆24Updated 6 years ago
- ☆26Updated 7 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated last year
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆78Updated 5 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆34Updated 6 years ago
- MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).☆55Updated 3 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 8 years ago
- ☆22Updated 7 years ago
- Some notes on Kaldi☆31Updated 9 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Updated 9 years ago
- Overlapped Speech detection in Multi-party Conversations☆18Updated 6 years ago
- Python functions to convert between different speech quality metrics☆54Updated 6 years ago