goepfert / audio_features
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio_features
- Evaluate results from ASR/Speech-to-Text quickly☆36Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated last week
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Tacotron text to speech in C++(synthesize only)☆75Updated 5 years ago
- Kaldi based speaker verification☆47Updated 6 years ago
- ☆28Updated 3 years ago
- A system works on singing voice synthesis☆79Updated last year
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆27Updated 5 months ago
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- Voice Activity Detection (VAD) using deep learning.☆192Updated 5 years ago
- ☆34Updated 10 months ago
- ☆50Updated last year
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 6 years ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆59Updated 2 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 5 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆16Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆57Updated 3 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆225Updated 3 months ago
- Charsiu: A neural phonetic aligner.☆278Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆42Updated 3 years ago
- Demo and samples for universal speech translator☆22Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Attention-based model for keywords spotting☆20Updated 3 years ago