goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 4 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago
- Speech recognition system implemented using tensorflow☆16Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a we…☆36Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last week
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- ☆43Updated last year
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Fine-tune WhisperAI model to your language☆21Updated 2 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆19Updated 3 years ago
- A java wrapper around the WebRTC Voice Activity Detection library☆66Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Updated 8 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Updated 6 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆54Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 6 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆27Updated 3 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 7 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆32Updated last year
- Keyword spotting by Kaldi library☆26Updated 9 years ago