goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 4 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆237Updated last week
- A java wrapper around the WebRTC Voice Activity Detection library☆66Updated 4 years ago
- ☆43Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆103Updated 5 years ago
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆27Updated 3 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Updated 3 years ago
- automatic spoken language identification☆90Updated 7 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆171Updated 5 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Syst…☆13Updated 4 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆341Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- Forced Alignments for Common Voice☆32Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- Speaker diarization scripts, based on AaltoASR☆191Updated 6 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆122Updated 6 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 3 years ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 5 years ago