jim-schwoebel / voicebook
π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β377Updated 2 years ago
Alternatives and similar repositories for voicebook:
Users that are interested in voicebook are comparing it to the libraries listed below
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 2 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β365Updated last month
- A library for speech data augmentation in time-domainβ654Updated 3 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.β240Updated 2 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ176Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ202Updated 3 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wildβ367Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binderβ127Updated 3 years ago
- feature extraction from speech signalsβ364Updated 2 weeks ago
- A collection of Audio and Speech pre-trained models.β183Updated 4 years ago
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. β¦β303Updated 3 years ago
- Voice Activity Detector in Pythonβ472Updated 4 years ago
- DeepSpeech based forced alignment toolβ234Updated 4 years ago
- A statistical model-based Voice Activity Detectionβ191Updated 6 years ago
- Machine Learning applied to soundβ251Updated 8 months ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ96Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ476Updated 3 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.β308Updated 3 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Deep neural network based speech enhancement toolkitβ212Updated 5 years ago
- Tools for Speech Enhancement integrated with Kaldiβ405Updated last year
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features libraryβ207Updated 4 years ago
- Speech Denoising with Deep Feature Lossesβ186Updated 4 years ago
- Python library for handling audio datasets.β136Updated last year
- Problem Agnostic Speech Encoderβ440Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting β¦β320Updated last year
- Deep Neural Network for Speaker Count Estimationβ146Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.β193Updated 5 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ429Updated 4 years ago
- Different implementations of "Weighted Prediction Error" for speech dereverberationβ498Updated 4 months ago