goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 3 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- Streaming Audio Models Examples in JS☆17Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆37Updated 3 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Mellotron singing synthesizer using CPU☆13Updated 2 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- Kaldi based speaker verification☆47Updated 7 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆30Updated 6 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated 11 months ago
- ☆22Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- ☆40Updated last year
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Updated 4 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆17Updated last year
- ☆54Updated last year
- Keyword spotting by Kaldi library☆26Updated 8 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ☆24Updated 5 years ago
- Tools for working with the CMU Pronunciation Dictionary☆35Updated 7 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago