goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 4 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago
 - Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆36Updated 11 months ago
 - On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
 - Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
 - 🐸TTS recipes for different datasets☆86Updated 3 years ago
 - Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
 - SEPIA server to support open-source speech recognition via WebSocket connection.☆132Updated 11 months ago
 - A java wrapper around the WebRTC Voice Activity Detection library☆66Updated 4 years ago
 - Kaldi based speaker verification☆47Updated 7 years ago
 - Extract frequency, power, width and dissonance of formants from wav files☆27Updated 3 years ago
 - ☆43Updated last year
 - 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
 - 🐸STT integration examples☆129Updated 3 years ago
 - A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Updated 5 years ago
 - A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆32Updated last year
 - speaker diarization system using an LSTM☆50Updated 2 years ago
 - flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago
 - Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
 - A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Updated 6 years ago
 - Classify audio with neural nets on embedded systems like the Raspberry Pi☆87Updated last year
 - DeepSpeech based forced alignment tool☆239Updated 4 years ago
 - Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
 - This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
 - How to create your own model for vosk☆75Updated 4 years ago
 - A pipeline to isolate and transcribe one language in mixed-language speech☆19Updated 3 years ago
 - This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
 - A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
 - VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆260Updated last year
 - python wrapper for rnnoise library☆48Updated 2 years ago
 - Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago