jim-schwoebel / voicebook
π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β376Updated last year
Related projects β
Alternatives and complementary repositories for voicebook
- feature extraction from speech signalsβ354Updated last week
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β364Updated last month
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ469Updated 3 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β532Updated 2 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.β236Updated last year
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ176Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ200Updated 3 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wildβ365Updated last year
- π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)β223Updated 4 years ago
- A library for speech data augmentation in time-domainβ643Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorchβ210Updated 4 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.β115Updated 5 months ago
- Voice Activity Detection based on Deep Learning & TensorFlowβ355Updated last year
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. β¦β297Updated 3 years ago
- spafe: Simplified Python Audio Features Extractionβ456Updated 4 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.β577Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team atβ¦β367Updated 2 weeks ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting β¦β313Updated 11 months ago
- Tools for Speech Enhancement integrated with Kaldiβ398Updated last year
- Problem Agnostic Speech Encoderβ439Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)β202Updated last year
- Voice Activity Detector in Pythonβ472Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ95Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ187Updated last year
- Speaker diarization scripts, based on AaltoASRβ190Updated 5 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ428Updated 4 years ago
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β1,727Updated 5 months ago
- An open source dataset for source separationβ378Updated 9 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.β1,137Updated 3 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasetsβ377Updated 5 years ago