jim-schwoebel / voicebook
π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β376Updated last year
Related projects β
Alternatives and complementary repositories for voicebook
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β532Updated 2 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.β237Updated last year
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. β¦β298Updated 3 years ago
- feature extraction from speech signalsβ355Updated 2 weeks ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ177Updated 3 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β364Updated last month
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ200Updated 3 years ago
- DeepSpeech based forced alignment toolβ235Updated 3 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting β¦β313Updated 11 months ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasetsβ379Updated 5 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ429Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlowβ355Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ469Updated 3 years ago
- A neural attention model for speech command recognitionβ180Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDRβ908Updated last year
- A collection of Audio and Speech pre-trained models.β183Updated 4 years ago
- Deep neural network based speech enhancement toolkitβ211Updated 5 years ago
- A library for speech data augmentation in time-domainβ647Updated 3 years ago
- Tools for Speech Enhancement integrated with Kaldiβ399Updated last year
- Utterance-level Aggregation For Speaker Recognition In The Wildβ365Updated last year
- Different implementations of "Weighted Prediction Error" for speech dereverberationβ494Updated 2 months ago
- Deep learning based speech source separation using Pytorchβ312Updated 4 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1β110Updated 5 years ago
- Identifying people from small audio fragmentsβ169Updated 4 years ago
- Python implementation of the Short Term Objective Intelligibility measureβ327Updated 10 months ago
- Variational Bayes HMM over x-vectors diarizationβ254Updated 10 months ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ95Updated last year
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorcβ¦β315Updated 4 years ago
- Speech Enhancement Generative Adversarial Network in PyTorchβ379Updated last year
- Speaker embedding (d-vector) trained with GE2E lossβ273Updated 10 months ago