goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 4 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆242Updated 2 weeks ago
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- A java wrapper around the WebRTC Voice Activity Detection library☆66Updated 4 years ago
- ☆44Updated last year
- automatic spoken language identification☆90Updated 7 years ago
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆37Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- speaker diarization system using an LSTM☆50Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- Desktop application for neural speech synthesis written in C++☆212Updated this week
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated 2 years ago
- Speaker diarization scripts, based on AaltoASR☆191Updated 7 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Updated 5 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Updated 4 years ago
- Tools for speech processing, keyword spotting☆17Updated 5 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 7 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆341Updated 3 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Updated 3 years ago
- Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Syst…☆13Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 4 years ago