jimkang / word-phoneme-map
Provides a two-way map between the words and phonemes listed in the CMU Pronouncing Dictionary.
☆9Updated 9 years ago
Alternatives and similar repositories for word-phoneme-map:
Users that are interested in word-phoneme-map are comparing it to the libraries listed below
- Re-sample Audio☆39Updated 4 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- Tools for working with the CMU Pronunciation Dictionary☆35Updated 7 years ago
- A collection of modules and utilities for doing things with phonemes.☆50Updated 2 years ago
- Face Detection in the Browser with WebGL☆34Updated 7 years ago
- Alphabot: a screen-less interactive spelling primer powered by computer vision☆14Updated 6 years ago
- a port of the Wavenet algorithm to generate poems (using Samuel Graván's @Zeta36 code).☆36Updated 7 years ago
- Neural network-based speech transcription model. Built with Keras (Python) and trained with TIMIT.☆19Updated 8 years ago
- Using Kaldi (Automatic Speech Recognition) and Gentle (Forced Word Aligner), this script finds both rhymes and alliteration in speeches w…☆13Updated 6 years ago
- Using Tensorflow's im2txt model to generate stories in an iOS app.☆23Updated 7 years ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 5 years ago
- Node.JS implementation of the MFCC algorithm☆40Updated 8 years ago
- Given a mixed song, remove components that you have☆22Updated 10 years ago
- Proof of concept app that demonstrates use of KeenASR SDK in ObjC. WE ARE HIRING: https://keenresearch.com/careers.html☆71Updated this week
- A Wavenet generative model in TensorFlow, trained with Western Classical solo piano canon with global and local conditioning☆11Updated 7 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 6 years ago
- Speech recognition in JavaScript☆18Updated 6 years ago
- MAGE is a C/C++ software toolkit for reactive implementation of HMM-based speech and singing synthesis.☆61Updated 10 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆75Updated 3 years ago
- Mirror of GlottHMM☆10Updated 8 years ago
- A smoothly animated spectrogram display in WebGL (FFT in Python/Tornado)☆42Updated 5 years ago
- option for recording videos server side using opus packets☆37Updated 8 years ago
- Dynamic time warping for JavaScript☆51Updated 8 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 9 years ago
- ☆66Updated 8 years ago
- MSc AI Project on generative deep networks and neural style transfer for audio☆63Updated 7 years ago
- Text-based media editing interface☆16Updated 7 years ago
- Variable speed pitch shifter written in JavaScript☆79Updated 11 years ago
- A tutorial diphone synthesizer in Python☆23Updated 6 years ago