dtjchen / spoken-command-processorLinks
Neural network-based speech transcription model. Built with Keras (Python) and trained with TIMIT.
☆19Updated 9 years ago
Alternatives and similar repositories for spoken-command-processor
Users that are interested in spoken-command-processor are comparing it to the libraries listed below
Sorting:
- singing voice analysis and detection tools☆21Updated 10 years ago
- Audio Analysis by Conceptor☆30Updated 10 years ago
- Music structure segmentation with convnets☆13Updated 9 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Updated 3 years ago
- Mirror of GlottHMM☆10Updated 9 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆69Updated 7 years ago
- This is now the official location of the Kaldi project.☆13Updated 6 years ago
- Phonetic and phonological vocoding platform☆17Updated 9 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 6 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Updated 10 years ago
- Ossian: A simple language-independent Text-to-speech frontend☆17Updated 7 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆28Updated 8 years ago
- Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…☆13Updated 8 years ago
- C++ Implementation of the Information Bottleneck System☆22Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 7 years ago
- Perform the forced decoding with target transcription☆11Updated 7 years ago
- Python implementation of the Flexible Audio Source Separation Toolbox (FASST)☆91Updated 8 years ago
- ☆20Updated 7 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The o…☆22Updated 7 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 6 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 10 years ago
- Util code, issues, discussions☆29Updated 7 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Updated 9 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 6 years ago
- An app that graphs and compares the pitch contours of spoken language, to help language learners perfect their intonation (Hackbright Spr…☆30Updated 8 years ago
- python wrap for hts engine☆14Updated 8 years ago
- a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…☆16Updated 12 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Updated last year