goepfert / audio_featuresLinks
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 3 years ago
Alternatives and similar repositories for audio_features
Users that are interested in audio_features are comparing it to the libraries listed below
Sorting:
- ☆43Updated last year
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆164Updated 6 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- Kaldi based speaker verification☆47Updated 7 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 4 years ago
- Evaluate results from ASR/Speech-to-Text quickly☆37Updated 3 years ago
- Audio2Vec with multi lingual☆8Updated 7 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Updated 4 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 3 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- Web app for keyword spotting using TensorflowJS☆72Updated 2 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Updated 8 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆270Updated last year
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20Updated 5 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- ☆82Updated 6 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- A TensorFlow implementation of Griffin-Lim algorithm☆79Updated 7 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆71Updated 6 years ago
- On-device voice activity detection (VAD) powered by deep learning☆218Updated last week