goepfert / audio_features
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 3 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- Evaluate results from ASR/Speech-to-Text quickly☆37Updated 3 years ago
- A java wrapper around the WebRTC Voice Activity Detection library☆61Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆126Updated 6 months ago
- Kaldi based speaker verification☆47Updated 7 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Updated 8 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- ☆33Updated 3 years ago
- Using OpenVINO to speed up MeloTTS inference☆11Updated 6 months ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- ☆43Updated 11 months ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- Streaming Audio Models Examples in JS☆17Updated last year
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- VoxLingua107 recipe for SpeechBrain☆13Updated 3 years ago
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- Python Wrapper of Silero VAD☆53Updated last week
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 2 years ago
- A demo of android key word spoting based on tensorflow tutial example☆28Updated 5 years ago
- ☆39Updated last year
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- 这是一个基于kaldi的iOS语音识别demo☆28Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago