jim-schwoebel / voicebook
π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β380Updated 2 years ago
Alternatives and similar repositories for voicebook:
Users that are interested in voicebook are comparing it to the libraries listed below
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.β243Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 3 years ago
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. β¦β307Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ203Updated 3 years ago
- feature extraction from speech signalsβ367Updated this week
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,308Updated 8 months ago
- A neural attention model for speech command recognitionβ183Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlowβ358Updated last year
- Utterance-level Aggregation For Speaker Recognition In The Wildβ367Updated last year
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β366Updated 2 months ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting β¦β320Updated last year
- π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)β224Updated 4 years ago
- Problem Agnostic Speech Encoderβ440Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorchβ211Updated 4 years ago
- Open tools and data for cloudless automatic speech recognitionβ447Updated 3 years ago
- Speech Denoising with Deep Feature Lossesβ186Updated 4 years ago
- A library for speech data augmentation in time-domainβ655Updated 3 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ478Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ176Updated 3 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender β¦β778Updated last month
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- Speaker embedding (d-vector) trained with GE2E lossβ276Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binderβ128Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β376Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ201Updated this week
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Speech Enhancement Generative Adversarial Network in PyTorchβ386Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)β328Updated 9 months ago
- Tools for Speech Enhancement integrated with Kaldiβ409Updated last year
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a β¦β245Updated last year