goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 3 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆32Updated 8 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆226Updated this week
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- automatic spoken language identification☆90Updated 6 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- ☆43Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆38Updated 3 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Speaker diarization scripts, based on AaltoASR☆189Updated 6 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- A java wrapper around the WebRTC Voice Activity Detection library☆62Updated 4 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Updated 4 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆254Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 5 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆17Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 3 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago